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1 CXMTCGAM CAAGICAGG?^ AAGCCIGCAC AGGACIGC^T AAAT?^TmA 
51 GAACAGACT3 TTCIGAACAT CAACAGAAAG TQGAAGAAOC TTAAGCIGAA 
101 OCTACAGTAT ATIATTrACA CTGAAOaaaC TIGIGTCT3G ACAAGAAAGC 
151 GCIGACAGCr GAAAT3GATC 'XMOGAACT GAGAAATCTC AAGATOSAAC 
201 GAGATGATiGA GAGCAGCAGT i^GAGAAAGTO CTOCAGATAG CIACATCAGG 
251 ATAOGAAATT GAGAAAAGGC AGGAATCAGC AGTCAATTIG CTAAICAAGA 
301 CACIGAAACT G?OWVITQC TGACAAATOG ATnTTOOGG AAAAAGAAGC 
351 T13GGAGATTA TGCTGATGAA CAOCATCOCG GAACXT^CITC CITIGG^yVTC 
401 TCnCATITA ACCnGAGTAA TOXATCATG OGCAOraGGA TCCiaOGCIT 
451 GTCCTATGCC ATOGCTTACA CAOaOCTCAT AClTlTiATA ATCATGCKX! 
501 TTOCIGIGGC AATATLATCA (nUTATTGAG TTCACTTnT ATTAAAAAGA 
551 GCXM3GAAG GAOGGICnT (GATTTATGAA AAATEAOGAG AAAAOjCATT 

601 TGGAToacaG ggaaaaatig (^AGcmrcr ttccattaca atcgagaaga 

651 TTOGAGCAAT GTCAPG:n:PC 'rrCITEATGA TTAAATAT3A ACmOCIGAA 
701 GTAATCAGAG CATrCATOOG ACTIGAAGAA AATACTOGAG AATOGTACXT 
751 CAATGGGAAC TACXTCATCA TATITCICTC TUirGGAATT ATIUITOGAC 
801 TTTCGCICCr TAAAAAITIA (3^7ITATCTIG QCTATACCAG TGGATnTCT 

851 crrAOcrocA TOGiurmr tottagigig gigatttaca agaaattcca 
901 aatacocigc cctctaccig titiggatga gagigtigga aatcigigat 
951 tgaacaagac gcitcgaatg catgiggtaa tgitaoogaa caaciutgag 
1001 agtiuigaig tgaacitgat (^tggattac a0ccaccx3ca atocigcagg 
1051 ociggatgag aaocaqocca agggcictcr tcatgagagt ogagiagaat 

1101 ATGAAGCrCA TAGIGATGAC AAGIGIGAAC CXMATACIT TGmTTCAAC 
1151 TGCD3GA03G OCnATOGAAT TOCTATGGEA GrATTIGCIT TTGYATOOCk 
1201 OCCIGAGGIC CTPOCGATCT AGAGIGAACT TAAAGATXI3G TCOOGGAGAA 
1251 AAATGGAAAC GGIGICAAAT AnTCO^TCA CGGGGATGCT TGICATGEAC 

1301 ciGcnGcoG ccrrcmoG ttacceaaoc TicmroaAG aagtigaaga 

1351 TGAATIACTT GATGCXTCACA OCAAAGIGIA TACATTAGAC ATCXXTGITC 
1401 TCATGGITCG CCIGQCAGTC CTIGIGGCAG TAACACAAAC TGIGCXCATT 
1451 GTCCTCITCX: GAATiaGIAC ATGAGIGATC AGACIGITAT TTaXAAAOG 
1501 ADCnrCAGC TGGATAOGAC ATTTOCIGAT TCCAGCIGIG dTATIGCAC 
1551 TTAATAAIGT TCIGGICATC CTIGIGCCAA CIATAAAATA GATCTKGGA 
1601 TTCATAaaaG CITCTICIGC CACTATGCIG ATmTATTC TTOCAGCAGr 
1651 TITITATCTr AAACl'IGiCA AGAAAGAAAC TTTIAGGrCA OOXAAAAGG 
1701 TOGOGGCnT AATITTCCTr GrOGTIGGAA mTIUITCAT GATIQGAAGC 
1751 ATOaCACrCA TTATAATIQ^ CIGGATTTAT GATCCIGCAA ATTOCAAGGA 
1801 TCACEAACAC AAGGAAAAAT AC {SEQ JD ND:1) 



% % 



FEATURES: 

5'UIR: 1-163 

Start Oodon: 164 

Step Godcn: 1805 

3'UIR: 1808 
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HCM3UX3CIUS PROTEINS: 

Tap BLAST Hits: 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



145000039337444 /alt id=gi 1 12017941 /def=^|AAG45335.1 |AF295. 
114000033649823 /altid-gi 1 10945621 /def=^|AAG24618.1 |AF298. 
160000003782430 /alt id=gi | 8677401 /def=^|AAF75589.2 |AF1736 . 
148000002720069 /altid-gi | 8248427 /def =^ |AAF74195 . 1 |AF2496 . 
87000000006802 /altid=gi | 7243145 /def =dbj |BAA92620. 1 | (AB03 . 
18000005069115 /altid-gi | 5870893 /def =ref |NP_006832 . 1 | tran. 
88000001154721 /altid^gi j 7406950 /def ^|AAF61849.1 |AF15985 . 
66000019404613 /altid=^i | 9506837 /def-ref |NP_061849 . I | amin. 
100000004435450 /altid=gi | 8926332 /def =^|AAF81797. 1 |AF2730 . 
335001098689635 /altid=gi | 11434147 /def =ref |XP_0 06635 . 1 | hy. 



Score 
975 
597 
591 
587 
578 
500 
496 
495 
492 
480 



E 
0.0 
e-169 
e-168 
e-166 
e-164 
e-140 
e-139 
e-139 
e-138 
e-134 



% 



EST: 

gi 
gi 
gi 
gi 
gi 
gi 



10934204 /dataset^dbest /taxan=96.. 
10286121 /dataset^dtest /taxDn=96 . . 
9872634 /dataset=dbest /taxcn=960 . . 
2656674 /dataset=dbest /taxm=9606 
9882497 /c^taset-dbest /taxcn=960.. 
689641 /dataset=dbest /taxon=9606 / 



1072 
718 
680 
549 
541 
525 



0.0 

0.0 

0.0 

e-154 

e-151 

e-147 



EXPRESSICN INFXDKMATICN FOR MCCXnATGRy' USE: 

library source: 

Expression inforrTBtion from BLAgP dbEST hits: 

gi 1 10934204 Whole etrbryo (nHinly head) 

gi 1 10286121 Hepatocellular carcincrra 

gi 1 9872634 Ncn-cancerous liver 

gi 1 2656674 Fetal liver spleen 

gi 1 9882497 Men cancerous liver 

gi 1 689641 Liver 



Expression information from PCR-based tissue screening panels: 
Mixed tissue {Brain, Heart, Kidney, Lung, Spleen, Testis, Leukocyte) 
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]yDEWELR^A/N 
51 KFLTNGFDGK 
101 AYTCVILFII 
151 KIGAFVSITM 
201 LIIFV5VGII 
251 LPVUHSVGN 
301 QAKGSIilDSG 
351 PIYSELKDRS 
401 AYSKVYTLDI 
451 IRHFLIAAVL 
501 LVKKETFRSP 



lEFCDESSSG 
KKLADYADEH 
MLLAVAILSL 
CKtGAMSSYL 
LPLSLLKNLC 
LSFNbTTLEMI 

veyeahscdk 
rrkm:?ivsni 
pllmvriavl 

lALMJVLVIL 
QKVGALIFLV 



ESAPDSYIRI 
HPGITSFOytS 

ysvHuucm 

FIIKYELPEV 
YLJ3YTSGFSL 
WMLTONSES 
CEPKYFVnSE 
SITO^ViyiYL 

vAviarvpiv 

VPTIKYIFGF 
VGIFFMIG3^ 



SFNLSNAIMG 
KBOGSLIYEK 
IRAFM2LEEN 
TCMVFFVSW 
SDVNFMVDYT 
RTAYAIPILV 
lAALFGYLTF 
LFPIRTSVIT 
IGASSATMLI 
ALIIIEWIYD 



QFANECflESQ 
SGILSLSYAM 
DGEKAPGWPG 
TGEWYlJ^C3SfY 
lYKKPQIPCP 
HRNPAGLDEN 
FAFVCHPEVL 
YGEVEDELLH 
UIJPKRPFSW 
FILPAVFYLK 

PF5SISKHH (SBQ ID N0:2) 



1^ 



FEATCKES: 

FuncticEial fVimins and key regions: 

[1] PDOCOOOOl PSOOOOl ASNjXYCDSYIATiaSI 
N-glycosylation site 



Number of matches: 5 



1 


83-86 


NLSN 


(SB2 ID N0:6) 


2 


260- 


263 


NLSF 


(SEQ ID rO:7) 


3 


264- 


267 


NNTL 


(SHQ ID N0:8) 


4 


276- 


279 


NMSE 


(SEQ ID NO: 9} 


5 


369- 


-372 


NISI 


(SBQ ID NO:10) 



[2] PDOC00004 PS00004 GAMP__PHOSPHO_SITE 

cfiMP- and cCSyp-dependent protein kinase phosphorylaticn site 



503-506 KKKT {SEQ ID NO: 11) 



[3] PDX00005 PS00005 PKC_FeOSPHO_SITE 
Protein kinase C phosphorylaticn site 



Nurrber of matches: 7 

1 33-35 SEK 

2 49-51 SQK 

3 129-131 lAK 

4 290-292 THR 

5 360-362 SRR 

6 473-475 TTK 

7 506-508 TFR 



[4] EDOC00006 PS00006 C!K2_EHQSPH0_SITE 
Casein kinase II phosphorylaticn site 



Number of matches: 5 

1 18-21 SSGE (SEQ ID N0:12) 

2 22-25 SAPD (SEQ ID NO: 13) 

3 129-132 TAKE {SEQ ID NO: 14) 

4 305-308 SIHD (SBQ ID ND:15) 

5 309-312 SGVE (SBQ ID N0:16) 
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PDOC00008 PS00008 MYRIOTL 
N-m/ristoylatiaQ site 

Number of natches: 6 



1 


95 


-100 


GLSYPM 


{SEO ID 


N0:17) 


2 


153 


-158 


GAFVSI 


{SEQ ID 


ISI0:18) 


3 


164 


-169 


GAMSSY 


{SEC' ID 


N0:19) 


4 


186 


-191 


GLEENT 


(SBQ ID 


ND:20) 


5 


296 


-301 


GLDE3SR2 


(SBC' ID 


m:21) 


6 


482 


-487 


GASSAT 


(SEQ ID 


ND:22) 



[6] PDOC00009 PS00009 AMIDATICN 
Amidaticn site 



58-61 DGKH (SEQ ID NO:23) 



Mentarane spanrujig structure and dcciedns: 



ilrx Begin 


End 


ScxDre Certainty 


1 


79 


99 


1 


125 


Certain 


2 


102 


122 




503 


Certain 


3 


153 


173 


1 


197 


Certain 


4 


197 


217 


1 


785 


Certain 


5 


222 


242 




123 


Certain 


6 


332 


352 


1 


240 


Certain 


7 


370 


390 




166 


Certain 


8 


414 


434 


1 


301 


Certain 


9 


453 


473 


1 


520 


Certain 


10 


476 


496 




166 


Certain 


11 


515 


535 




628 


Certain 
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12017941 

/def=^|AAG45335.l|AF295535_l (AF295535) anrmo acid 
transport system A3 [Rattus norvegicus] /org=Rattus 
norvegicus /taxcn=10116 /dataset^nraa /length=547 
Length = 547 



% \ % 

% 



Score = 975 bits (2492), Expect =0.0 

Identities = 478/547 (87%), Positives = 508/547 (92%) 

Quer/: 1 mFME]J^3S[VISriEP[IDESSSCISAPDSYIM 60 

NDP+ELR+WIEP ++S S +S Y 4<3^SEK AM SQFANED ESQKFLTNGFU^: 
Sbjct: 1 MDPIEIi?SVNIEPYEDSCSVI^IQSCYra^SE3<G^^ 60 

Queiy: 61 KKLADYADEHHPC?ITSPavISS^^ILS^^ 120 

K L DYADEHHPCI^ISFa^FNLS^M^^3SGII^^ TG++LF+INLL VAILSL 
Sbjct: 61 KTLTDYADEHHPGmR>TSSn^L£NAIM^ 120 

Query: 121 YSVHLIIiaAKK3GSLIYEtljGEKAR3^^ 180 

Sbjct: 121 YSVHUlia?\KBGGSLIYEflX23(AroWPGK^ 180 

Query: 181 IRAFM?.f™tGEWYI20JYLIIFVSVGIII^ 240 

Sbjct: 181 TP\/FTv r;r .Tw rrrom Mrwr \n .T^/^^/nTTT ,pt .qr ,t ,kmt nvr nvrcr^Fgr .ttm/ffvp^a/ 240 

Query: 241 lYKKFQIPCPIWLI^^SVGNI^FTS^^ 300 

IYKKFQIPCPLPVIXH+ Cl^ITmiJMIV+MLITO^ +NFM+DYTOR+P GLDE 
Sbjct: 241 lYKKFQIPCPIJVIIJMOJLTFT^^ 300 

Query: 301 QAKGSmDSGVEYERHSTlDKCEPKYFVF^^ 360 

A G IH SGVEYEAHS DKC+PKYFVFNSRTAYMPIL E?^CHPEVLPIYSELKDRS 
Sbjct: 301 PAAGPIKSSGVEYEfiHSODITOPKYFVFNSRT^^ 360 

Query: 361 ly^KM^rVSNISiraviLV^^ 420 

RRK^UIVSlS^ISITCMJV^raiAAL^^ D LLMVKLAVL 

Sbjct: 361 RRKMQIVSISn:SIT3y[LVM^^ 420 

Query: 421 mVIUTVPIVIJ^IRISVITLIJPKRPES^^ 480 

VAVT TVPIVIi^IRISVimaFP+RPESW++HF IAA++IAUSINVLVILVFTIKyiK5F 
Sbjct: 421 VAVTLTVPIVIi^IRISVITLIJTR^ 480 

Query: 481 IGASSAIMLJFII^AVFYI^ViaCEITKSPQKVGAL^ 540 

IGASSAIMLIFILPA FYLKLVKKE RSPQK+GAL+FLV GI FM4GSMALIIIDWIY+ 
Sbjct: 481 IGASSA™j:FIIJ>AAFYIi<LVKKEFIJ?SPQKIGAL^^ 540 



Query: 541 PEI^KHH 547 (RESIDUES OF 1-547 OF SBQ ID N0:2) 
PFN HH 

Sbjct: 541 PE1SIPCHH 547 (SEQ ID m :4) 



>CRA 1 114000033649823 /altid=gi 1 10945621 

/def=^|AAG24618.l|AR298897_l (AE298897) amiiio acid 
transporter system A [Hcmo s^iens] /org=Hcmo sapiens 
/taxan=9606 /dataset=nraa /length=506 
Length = 506 

Score = 597 bits (1522) , Expect = e-169 

Identities - 315/549 (57%) , Positives = 383/549 (69%) , G^s - 46/549 (8%) 

Query: 1 MDElviEIi^IMJIEFODESSSGESAFD---SYIRIGr^^ 57 

M E+ +1 PD4-+SSS S D SY +++AA+ S 4-A+ D E+Q FL 
Sbjct: 1 MKKAEMGKESISPDEDSSSYSSNSDFNYSY PTKQAAIJ<SHY?yjVDPENaSIFLLESN 56 
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58 LaaoO^YADEHHPGITSPGMSSIT^LSN^^ 117 

IJ2F3(K Y E HPGirSFO^ FNLSNM4GSGI]J3LSY7y^ TGf LFII+L V+I 
57 Lj3a(K---YEIEFHPGITSK3VISVFNL£^^ 113 

118 I^YSVHLIIiCD\KB3GSLIYEKLGE^^ 177 

SLYSVHLLLim EGGSL+YE+LG KAFG GK+ A SI1M:KIGAMSSYLFI+KYEL 
114 FSLYSVHT J J .KT?\NEa^Il.YBQlCTKARXVGKI^^ 173 



Query: 178 PEVIRAFMGLEENTCEWYIJ^GI^ 237 

P VI +A +E+ TG WYiraSIY]>+ VS+ +ILPISL +NIjGYIjGYTSG SL aWFF+ 
Sbjct: 174 PLVIQALTNIEDKmWYLi^t3ra.VLiVSL 233 

Query: 238 SVVIYKKI^IPCPLPVIXHSVCl^IWII^^ 297 

WI f3CR>PCP+ + N + N TL ++P 
Sbjct: 234 rVVICKKFUVPCFVEAA--LIIl«"INm 269 

Query: 298 DEHSAirSIiTOSGVEYEAHSroKCEPKYFVFNSRTA^ 357 
+ + +D C P YF+FNS+T YA+PIL+F+FVCHP VLPIY ELK 

Sbjct: 270 alshnvtembcrphyfifnsqtvyavpiij:es 

Query: 358 DRSRRKNX?IVSIOTSria^WtrLl^^ 417 

DRSRR4M VS IS M -fWYIlAALFGYLTFY VE ELm YS + DI LLrfVRL 
Sbjct: 318 DRSRRR^WIVSKISIT7WIMYL]y^^ 377 

Query: 418 AVLVAVIOTVPIVLFPIRTSVITIJ^ 477 

AVL+AVT TVP+V+FPIR+SV LL + ESW RH LI ++A N+LVI VPn+ I 
Sbjct: 378 AVimVTLTVPVVIFPIRSSVTHLLa^KDFSiA^^ 437 

Query: 478 PGFIGASSATMLIFILPAVFYLKLVKKEnTFRSPQKV^^ 537 

FGFIGAS+A4MLIFILP+ FY+KLVKKE +S QK4GAL FT> Gf M GSMALI++DW 
Sbjct: 438 P3FIGASAASMLIFILPSAFYIKLVKKEIM<SVQKIGALE™^^ 497 

Query: 538 lYDPPNSKH 546 {RESIDUES OF 1-546 OF SBQ ID N0:2) 
+++ P H 

Sbjct: 498 VHNAPOGC^I 506 (SB2 ID NO :5) 



antner search result:s (Pfam) : 

Model Descriptico Score E-value N 

PF01490 Transmenforane amino acid transporter protein 187.0 2.9e-52 2 

CE00398 E00398 CD53 4.0 4.8 1 



Parsed for dcmains: 



^kxi^l 


Domain 


seq-f 


seq-t 


hnm-f hrrm-t 


score 


E-value 


CE00398 


1/1 


90 


110 . 


1 23 [. 


4.0 


4.8 


PF01490 


1/2 


99 


236 . 


1 179 [. 


58.9 


2.5e-14 


PF01490 


2/2 


305 


529 . 


200 467 .] 


133.9 


3e-36 
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1 AGCTTAGCAA TATO^TCAA GAOCJrCGAAT MXH^TmA TAAAAC^TITC 
51 AGGAGTAAAC AAAGOOGAAG AAATAGmT TTTAAATAGT AGAAL'iTl'i'i' 
101 TTATITTTAG AAAATtTICTC TTCTATAGAA GAAAGACAAG OCrmGAIT 
151 GGGOCCTCIG GMtXTCAOT ATTGATGAATT TTAAAAGOGA CTCACATCrA 
201 GTCAOJKXJT GATGAAAQC¥\ TAM3ATAAA AATTOGAAA TCCICAGAAA 
251 ACCAT03ATA AATTATCTAT AA^^GAAAIAA GAGO^^GACT CATCAATMA 
301 AGCIAGAAGA GACPAGTITC TTCAATATTC TGAAGGAAAA TGCITCIGAA 
351 TCTA3AATTC AAACAATTAA CAAAGTITGA AGGGAAAATA AAGAATTITC 
401 GAACATGAAG GAACTCAGAA ATICTATTIA GAGACATAGG CTCATIOIGr 
451 GAAAAAAGTT ATICA/^SGGA TTATTITAGC ATAATGGAAA ATAAACIGAA 
501 GAAAGAAGAT AGAATGCOGT TGAAGAAACT AGGAGCIX^ CAAGACTGAG 
551 AGGirGGAGG AGGAAGCTAT TGAGAATGAG AAAGAGCATA GAAAATTIGC 
601 TTTGAAAGIT TTOGTAATAT AGAATIATAT TTCAOTATT ATGIAGrGAA 
651 ATAGACCACr TTGTCnTAG GGGATACTAT TTATAGAGTC ATAATACIGT 
701 AATIGCIGCr TATIGGmT OGAIUnTAG AAACAAOCIA CAGGCAAGIT 
751 ATGAGACTPG TTTCAGAGAA CAAGATGAAA ATATIATGAT TCTCAAATIG 
801 TAAAAGEATT TTATTAACEA AAATAATIAG GAGrGTAOGA GAAGGAAQGA 
851 AAGAAAGAAA AAGIATGCTA ATUrOCITAT 'ITITIATOGG TAACCAGIUT 
901 AAAATCAGTA AAOGAAGTCA AAAAAQCriT AGIGAATTAT TGAGATCEAG 
951 AATOGCEAAC TTEAAGTAAC AAGCTAAAAA GAGAAACOGT CAATAGIGGT 
1001 TOCIGCIQGG AAGIGAGACT GGEACTGIGT GAAGAATGAG GAAAACCTTT 
1051 GTACrGATIT AGIGAGITrC I'i'i'l'i'i'i'iTl' CnTEACOCA TATGCATGTC 
1101 TTAL'l'lC'iAT TGIUICTTAG Cl'i'l'iAAOCT GCXTCmTC ATClTi'lATG 
1151 TATAEAGATT TAGGCIGOGT TATATIAATA AmTTITGAT TITIGITOCr 
1201 OGIGCTTAAA ACACICTGIG CTAi'l'i'i'l'i'i' AAATICIGAG AACIGCITIG 
1251 TTEAmurA GAGAATTGIC TGOGATTATC TCmCIGIT TICTCrCACC 
1301 CIAGrcrCAC AATTCrCTAT ATIGGAATGA GEATGAGIGT ATATTTGAAC 
1351 TTGEAATIUr TATTTnTOC CGAITOCrcr TAACirCITA TTIGXATTTr 
1401 TCirrTTITA ATCTCITGAT GCEATAATTT GAGrGATTTC CAGAGATCIG 
1451 TGirrGAATT TTATAAGICT TQCITCAGCr GAGiTlTi'lT AAATITCAAT 
1501 GATTCTAnT TmCITnT TTEAAGAATT OGITmTIG ACICTTITIG 
1551 GAAGAGOCIG TTCTCCmT ATATTCCnT ATAATGmT TATICIGIGA 
1601 AAGITATICr CTEATnTGA ATGmTCIT TCAAAATGIG TTTCTmTA 
1651 TTAAnTAAT GTAAAAGTOC Cl'lTJAAATT GCITIGnAT TIGIAGITOC 
1701 TTAGAOGIGA ATTITATCAT TTCTTGItXX: TACIGGCACT CTIGCIAGIG 
1751 AGirrCGATG TGIGTICIAT ATGnTIGIA ATnGAGGAT GK^^CiTlT 
1801 CTCAAGIGIG ACTTIGOGTIT GAAAAAAGEA CIGOGATOGC ACrGGGTIGT 
1851 GGAOGIAITC CGATGIQGEA GTrrCIGTIT GTGAGAGGAA TAGCAGATIT 
1901 TGrGACITCr OGAGGAATTT TTAIGITAGr TTCTCIGCTC AAGATITOGr 
1951 TATCAAAIGG CjEATIGCAGA TCTCATGAOC ACACl'i'i'lGA AGAATGATAG 
2001 TCTTTCracr AATAOGATOG irCAAGAAm ATIGAATGAA TCEAATGGEA 
2051 AGAAITTCAG AAGAAATEAT ATGAACTACA TATAGTAGAT TCAAGGGAIT 
2101 TTTGAAAAAC AGAATGOCAG TCCAOCOCIT TTGACTATAC AATIGAGGAA 
2151 AATGAGGTCC CGAAATGTIA AATGACnCT GCIGAGATOC AATGAATEAA 
2201 AGGCAGAGGA GAGGCEAAAA TGEAGATCTC Ti'lUriUriA AAATACATTT 
2251 lAATITGAGA CAGATGAra\ GrAATGCIGA OCGAGAGGIA AATCIGAACT 
2301 TTCrmGIT ACTATTCrm ACITIGGCIT GAGGATGCAA GIGOGTAGAA 

2351 AGrmcrrac taaactigat cctgaogiat gtigcatatt atcaagcait 

2401 IGGIGGIGIT AATTCnTGA TGICCAATTA AATTAAAGCA GIAAmTCT 
2451 TTCIAGITAT TGCrAGTAGA GAGACIGGTA GATTGrGOGT IGGIAGAOCT 
2501 TCCICICTCA ACAATTTACr TTIGrGITaC TITCmTAA AAGATGTATC 
2551 GCAGTCACAA ATACCIAAAT TrOGTIGAAG ACIGCIGOGA TGi'i'i'iAAGA 

2601 TiTCrrnrr TxrocATAGr gactageaaa acgigocatt ttcaitatac 

2651 ATAOGCACTC TATAAATATC TGCEAATTEA GCAATTATrA GXAAnTOCT 

2701 TTdTcrcrr ccATnunc crrixji'iuiA ttoggiaaag gaacattica 

2751 GGATTTGCrr AlCTAAAGIT TTGAaGAlGIT TCnTOCITC CrOOGmTA 
2801 GAGAGAGGAT AGAAAATGEA GATGATTGAT ATTGACTIAT TTCAITrAAA 
2851 TAAAATTAXA ATGATUIATG TIGHGITCIG TITGGAGAAC AGAGICTICT 
2901 GAAGATGAAC AGAAAGIGGA AGAACGITAA GCIGAAGGIA CAGIATATIA 
2951 TTTAGAGIGA AGGaacnOr GIOIGGAGAA GAAAGOGGIG ACAGCTGAAA 
3001 TGGATOGCAT GGAAGIGAGA AATGICAAGA TOGAAGCAGA TGATGAGAGC 
3051 AGGAGIGGAG AAACJIGCTO: AGATAQGEAG ATa3aGATAG GAAATTCAGA 
3101 AAAGGGAGCA ATGAGCAGGT ATa3GGITAA AAATEACTAT GITOCATGGA 



FIGURE 3A 




3151 



AAAATAAG?^C NJ^TGVGCA CATGGAAAAC 




Docket No.: CL001 01 0 
Serial No.: 09/776,705 
Inventor: Karl GUEGLER et al. 
Title: ISOLATED HUMAN TRANSPORTER PROTEINS... 



AGGCTIUriGA 



3201 TOGATTmiT ACAGCJCAAAT TIGTGATAAC AATCATATIG ATGCIAGCAC 
3251 ATCAATTQOC TOC?T0Cr3AA ATACAGIGAT AATCTCAATC TCTTTIGIGA 
3301 CIGATTTAGA ATIGAGCJEIA CAAICTCTIT GTCTCCATTA ATAATGICTA 
3351 ATAATTITAA TIATTnAQC CIATTCCTCC TCTIATCnT CTCAGATIOC 
3401 TCTTIGAATC TTGCTACAOC TOCIOCTTK: TGrAGGO^lT CrmCTCTC 
3451 TAAAAGIATC CrCTGOGCAA GCICACTGAC AACTACTAIG GOCTCAOOCT 
3501 OCAAATATAT GOCATATAOC CAGCCKTITA AGTITCTCTA CTGAATTICA 
3551 GATAATIATA TCIGAATCTC TACToGACOT CTCIACTOGA CCATIACIGT 
3601 GIUEAAATTG OCTCATTEAT AAAGTIAAAC CTCTAATUTC TAATACIG^^ 
3651 CrOCTATUTT TOOCTOCAAA ACCIGCTGCr 0CTCTA7IAA TOOOCATOCT 
3701 PGT&J^TC ACIGCCATCA TCTAGGAACT GACTCAAAAG OOOCTAOCirc 
3751 TAAACTTIGA OOCACATAGC GAACX3CTCAG TCATATCGAG TTOTrnGAC 
3801 CITATTAAT3 CTICAAATAC AOCTACmT CIGIACCCAT TCIACICT3G 
3851 TCriAa?ITA OGCCEACATT AAATGIGAGA CAaOGAGA:^ GOOCTGATTT 
3901 CTCTCOCICT CITACATnT GCKTOCTCT GTCTAGCCCT CTAGACrOCT 
3951 GCAA3AGCAA TCIUITACAA TIGCAAATTG AATCAATIT2 CATOCTTAGA 
4001 TAAAGOOCIT iITOCAOaCr CCAATAGOCA TAAGA3AAA5 TAGATEACAC 
4051 ACACIGCIGG HCACXUmOG TCCmGIGA TUIGITdT^ ACCIGGCXrT 

4101 criurociCT tttttgocct croccrATrr GriAcriGrr GCcrrcAcrc 

4151 ATTCIGCroC AACIGCCT3G AATCAGTCAC CIGCKXTDi: TTTCTCrnTG 
4201 TIGACAOCTC TCATCCITCA AGAATCAGCT GAACATCAGG TCTOCIATGC 

4251 AGGcnrrcc aaattactct acvcccccat gtagaagiga ciGoaxTO: 

4301 TTCATGEAOC CIUICOCTGr GGAGATGITA ATTAOjOCAC TACmCAOGr 
4351 TAATOGOnC TCT3GiaXA CCACTTIGOCA OVnUTCIGG TGGATAGIGA 
4401 GIGGACAATA GTTATTIGAT AAGTGAATTG AinOCCACA AAAIGITATA 
4451 TCAAATTGrA CATGATTIAA GATGCIGAi:^ AGGGAATTIT IGAOGAAATC 
4501 TAOOaGIGAA ATAGAI3^TA TIGIGCTCAA ACAAAGACIT CTCAI'ITIAT 
4551 TTACAACAOC CAGGAAAATC CATCAOGAGA AACrAOOGIT CTIUITICAA 
4601 GEAGCTGAGT QGAATGAACT TTAGOGATGr CGGACIAGAG AGGCTACIGA 
4651 GATGIAAA1T ATAGCAmT CTAAATIAaG IGACQCTIGA AGAAACACTA 
4701 OGGIGCEAGA AGACAGGGCT TTGSAGTCIG CAGAGIAGIT GOCIGACITT 
4751 AGAGAAGCIG TTIGTOCTCr TIGAGCTICA AIGGAAAAIG TAAAATGQCA 
4801 AAGGAAGAGC TGCmTGAA GGAIGAGATG GGIGAOCAGA ATATAGAIGA 
4851 CATTCAAIAC TriTi'lATEA CTICTOCTrC ACIGGATTAC CCICAGIAAA 
4901 TIGATICAAA OCTGAOGATG TTTGIGAAAG GCAT3CACAC AAAEAIGAGC 
4951 TCIGCOGAGG TTGAGAGAGT TAAAGOaGAC AOOCTGGIAA GAACIGICAT 
5001 AGIGTCAITC GACITGAT02 TGAAAAGCXA GAGTAGAAAG AGCATCAAIG 
5051 CrmCTTAA GCITCArGCA ATGIGITGCG AAOCACTCAC AGIGACITAC 
5101 CrmATCTC CIGGCTIAAA GATAOGACAT GATTTIGGAG TmTAAAAT 
5151 GAGITIAAAG AGATOGGTIT TATCEA3GIG laTTTIGGRT TGAACOCITA 
5201 AATGIAAATT TTIGACSy^ TCAAGATAAT GTATTIATTr GIGATCATIA 
5251 lACrrGIGIT TTCAAIACAT GCIGGGi'l'iG GIATCAAAAC ATTEAACAIA 
5301 CraaOGACAT TICTCATCrA 'I'i'i'iATACAA TCTIGGCAIG ITAAATGACr 
5351 ACAACTCATC TGAIGOGAAA ATAAGAACAT GCAAATGOGT CAAAGAAAGA 
5401 AAATUIGnr ACrCTCAAAT ICTCAATnT AAAAAdACT AIGGAATAGA 
5451 GATTITAGIT TATIGAITAA AATAAAGATT CGAGAGITIA AATTCTAGGr 
5501 OGGAL'i'i'ilG 'i'l'lTiATAGT CCIGAOaOOC Al'l'i'IAGGCT TCAl'lTiATC 
5551 CIGTCATCrC AGICTOCAAC IGIGAAGATT AIUIACGAGT CTTCACATAG 
5601 CAGGEAGATT AATIACAGAC CATTAATGIA AAOCACAAAA GAGIGGIGOG 
5651 CAGIGGGIGG GGOGIGAATG GAAAT3GAAA GAGGCAAGAA CIGAGQGGAT 
5701 TGIGCmcr GIGAGAAATA TGOGGAGAAG GdAQGAAAT GITCnAACr 
5751 IGIGTACTCA GAGCIATTIA TGOCnGAGT TCIAGAAAAG CACATACAAC 
5801 TnUIGGnT OGIGIGCIGT TTCIATCIAC AICTGATACT GmTCIAIT 
5851 CrCAAAAAGT AAaxnGTGA TOCrCTTTCX: TCrOGAGATr ATnTCAGGA 
5901 TTAQCnUIG TIAIAAAAAA TAGCTIGIAC AGATCTOCIA GAAIAATIAT 
5951 TTICIATnT ATTIUIAAGG TTEATTrATT mTTIATIGA GAGAGAGAGA 
6001 GITICACrCr TGIOXCCAT GCIGGAGIGC AAT3GIGCAA TCTCGGCIGA 
6051 CIGGAAGCTC TGGCTOCGAG GnGAAGOGA TTCrOCIGCr TCAGOCTCCT 
6101 GAGIAGCIGG GATIAGAGGC GCCIGOGAOC AGACTCGGCT AACmTlGT 
6151 ATTIUTAGIA GAGACGAAGT TTGACCATGr TGOOGAOacr GGIGTIGAAC 
6201 TOCIGAOGIG AAGTEATOCA OOGAOCKAG OCTOOCAAAG TGCIGGGATT 
6251 ACAOGOGIGA GOCACTGIGC CIGGOCTCIA QGATIATATr AATAGAACAA 
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'^-•'^QM^^S^^Ol TCrrGAATTA TTITATCnT CrrrATCTIT CmTGATGr AGGAAATGIC 

6351 CIAAAATnT GAAAOOCTCA ATTIGAAAGC ALTITiAAAA TGATACATAG <V> 
6401 TCGAGGATTI TATATAAAAA CAACTAAAAA GTCIGIGACA TTnGCAGIA '^'^Jj 
6451 TAAAAATCGA AT33GfiGGAG CAOGOCTTAT TAATIGAQOC TCnOGAAAT - 
6501 GIGGCraCTC CIWJKUJT AGCUTGAAAG aOGCIQGCIT GTAACTGCAG "^-^ 



6551 GAGCTGACX^A GCACAGCTCT ATAACGAAGI TGIAGATCIT CEAGCCICTG 
6601 TOCAAGAAAA CGAGAATCAC AAOGCICICT OGATACTGAC ATCTTAAAGr 
6651 TTTCmOOC TCOGAACrCT TITGGCAGIT GATTGAATIG CTTEAATAAT 
6701 TTOCITAGIT TCATTGATIA TCIGITAATA ATOCATCTAC ATTITGAGAG 
6751 TAATTAAAAC ACATACGGAC AGACAGAAAC AAOGAAGAGA AGACAGAGCT 
6801 ACGACIGAAT TACTITCCAG TAAGAGATGT ATGTATAAAT GATIGTACCA 
6851 AAAAAAAAAA AAGAAAGAAA ATACGAGCTA GAGGGCCCrG CCraOGACrG 
6901 CITGATGGCA aGQGGAGAAT GQGGrcrCGC OCIGGGTATG QGIGGGrATG 
6951 OGCCIGCIGC TTGACCnTC TGAGOGAGAG TTOCCTATAG OGATATTriG 
7001 AACATCAGAT GAGATAAGGA TCACAGIGOC TAGGGATTTA ATAAATATTC 
7051 GirGAATTAA TAAAATGATC TGATTATOGT ATGGEAsJEAG TTGAGAAAAT 
7101 ICTGrCATAA OCCIGEACIC Ti'iL'l'riQGA AOGaCrCIAA ATGGGAAGAC 
7151 AATTAGTICT AGICTCITGC ATAGCIAATG TGAGAAAGAG GGAATGTGGr 
7201 ATAAACAATT TTITAACrAA AAATAATATT TCCITCCnT ATAAGATGCT 
7251 TGITOGATOC GAAAGIATAG TTGTAAATGG AACTGAAAAT TGirGGTCTG 
7301 GAATGAGOGT TAGIGIGAAG GAQGAAAAGA AAATIGGGGT GTCITATTrC 
7351 OCXJCOGICTG ATTGAGITAC TTAGATGACC TGAAAGATAC ATATGATTGA 
7401 GAGCATATAT TTAGATGITr TGACITIdT ATnGTGIGr GIUICTGITC 
7451 AGTGAATTIG CEAATGAAGA CACTGAAAGT CAGAAATIOC TGACAAAT3G 
7501 ATTmOGGG AAAAAGAAGC TGGCAGATTA TGCIGA1GAA CAOGTAAGIG 
7551 AATCTAIGCT TTGAOOGAAT AAAOaOGACr GAaOGIUIUr GATCrAOCIA 
7601 GGTCrCIGIG GGAAAAGAAT GIGACIGAAA TTTTOGAAGC CITGATGAGC 
7651 AGATTCIGIG TnATrCAGG CTCITACiaG AATAAGQGCT IGmnTOC 
7701 TGnOGCCAT ATQGCTGCAT GAATCATITA TGAAACTIAT GIUi'l'l'iOGG 
7751 GQGAAATGAT TCEAAGCGAA AQCTAATCTA GAATGATAGA TJi'lTiCOCT 
7801 TCnTATGIG ACICOOCTIG TAATXTGEAT TITEACIGAG GCCICTGCIG 
7851 AAACGAAGCA CIGGATTOaG TTGAAAATTA CATGCTTITA TTS^IGTIGA 
7901 GEAATOGCTT mCTOCIGIA ATGITATCTr AC^ICTTCAAT TTIQGACIGT 
7951 AATCIGCAGA TAAIGIGAGA ATAAOGATAA GOCCIAAAGG TATGCOGITT 
8001 aaCAAATGIT TGCmTAAT ACATOCmC Ti'lTiCAAGC ATOOGGGAAC 
8051 GACrnXTIT OGAATGrcrr CATTIAACCr GAGTAATGCC ATGAIGGQGA 
8101 GiaOGATiOCr GGGCnGTOC TATGCGAT3G CCAAGACAGG GATCATACIT 
8151 TTEATGIAAG TGAAIGIATA TGICTAGATT T3GIGATGAA GTOGATGGAT 
8201 AOCTGGTQGC 'I'i'l'l'iCAATT AACAATCTCA AGITIGATGr TTGIGAAOGT 
8251 GAAGACTGAG AGGAGGCTAA TGATGOGACT TSGTCACOGA AOGATCOCTA 
8301 ACOCAACGGC AGAAAGIGTA TGTGCICAAT CAAOGAAAGT GCTOGAGGAG 
8351 OGTOGCGAGA AGAATITIGr TATTGAGEAA ATACTTGAAA TAATnOGIG 
8401 TTTAGGAAOC AAAAAGATCT TTOOCAGAAG GAAATGTGAT TTTATCTCAT 
8451 TdTAGGAAA GAAGGAACCA AGOCTAAGAG CXXTGCATGC CCTIGOCIAC 
8501 CTEAIGTOCC ATTCOCIGEA 00CCICT3CG ACAGATACAC TOGGCAGAAT 
8551 AGOCITCrcr CGATCCEATG AAGATGCCAC AHTlTOCrCTC ACXATIGGAC 
8601 CITIGCACAT GGICnaGAA COirrCTICrG TTOCrrCITC ATCTAGITAA 
8651 CrOCTCATAT GTCAiljITCAG TCTGAOCrGA ATACIGOGCG COCIGATCTC 
8701 CATGACIGGG GCAAATGAOC TTATGATAAC ACTCAOCACA ATTTTAATGr 
8751 TTIAGrGOGA TTTGIUIGAT TCATnGGIT AATATCIGTC CCKTIGCIG 
8801 GACEATAAGC TCIAGAAACT 1GAGO0GATG TCIGmTEA CTGADCAAIG 
8851 TCICEACCrC GAAAOCIT^ GGAGTGCCIG GTACAGGGAA TATTIGriGA 
8901 GIGACCAAAC CTXATTCCEA AACCTAOGrA CTITCACCAA ACTIGITCAA 
8951 ATCCraXTA AOGGEAGCAG GATCIGGIAG TTGAOCTGTA OGGIGGATAC 
9001 TGGACIGTCr ATGACAGACA AGAAGAGACG TTrAIGIGGA TGAIGTAGAG 
9051 CCIGGCAnT TGCAGGATAT AGTIGGCAGC AGIGGAATTC TICACAAGAA 
9101 1AAACTCIGA TGrEAOGGAC GACIGIGGAC AGAGATGCTA ATOGCAAATG 
9151 GAAGGCEAGA GAGTTAAA'm ACIGICEAAG AATGCAAGAT TTATATCACA 
9201 AATATGIGCT G'i'i'iATGnC TGAATATGAC ATATGATEAG TAATGAGAGA 
9251 GCTATriGAG GGCEAAGCAT CAOGACrATA AATATTIGIA TTGIGTIAGT 
9301 GCITIGAriG AACTCITITA TGEATAATAT TCTTCAGCIG AAiaQb'lTi'i' 
9351 mTATCAACT TTALTlTiAT ATAAGCCATG TmGAAATA AACTAGGAIT 
9401 TTAAlAATCr GAATTITAAT AGCIATGEAT GIACTCATAT AiTil^lATCC 
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9451 TTTICTAATC TGCTTAOCTC TAAGAGAAAA AAACXTCCTT TTCCrmTTA 
9501 ATTATACATA OCATTAAAAT GAATTAGGAA OITAGAGATC ACIG?^TGAAT 
9551 AGAAATA3GA AAAACITOOC CCAATCDCAC AGTCATAGAT CATCITCATG 
9601 AGAGAAGAAT OTTOCACTTr TIAAAATGAG QQCXTPGATTr TAG3CrTATA 
9651 AACACITAGC AGATt^TTT '^C^ICAGAACA ATTAAATCAC TAAAIATG^T 

9701 oGGCTiaiGrr nCTJiGiur aagtagcxx^v (^ciogatta AGcrrrcicr 

9751 CrTAATTTAT AGGAAGTGAC ACAGTATnT AAAQGnTIA CrdTAGIAT 

9801 TTTCIGOCAG AGAAAGTACA TOnTAGAAT ACAOOGAATG CrCATTATTT 

9851 TTCCAOaGAA GAAAATTATA TAATUIGAAT TACATTATTC CTIAAAAAGA 

9901 GrTAAGITCA TAAGGCATAT i3GAAAAATAT AGGAAXAATT CATIGGTIAG 

9951 ACAGTrCTOG CAAACATACT CTATGGAAAA TAAGAGTGCA AGAIAGCEAC 
10001 AGOOGITATA AAATTIATAA TrcATOGrOC AAATGTACAT TTGIAGTATT 
10051 GATiTCATiG g:wvttaoc:a AOOGATTAGA TCAATIGIGG GGAAAGIGTA 
10101 rri'iTiAAAA ATAAACAAA3 ATAAAGATTT TmTCTGAA TrOCAQGEAA 
10151 AAGGCAGCAT TGCTXCTOCA TnATEAOGT AGATGCITCT ATCAACATTC 
10201 TTATTITTGr GCTCGAAATC TTGGATTiaG AAAAATAOCA ATOGGEATAA 
10251 ACATAAAGAA A'XATACATG CATGrOGOGA TOCTAACAGC AGAAATGACT 
10301 CTGAATGGAA AAAAAAAAAA AAAAAAAAAA GGGAATTITC GIGCOCCATC 
10351 CTTAGCnTC T^TIGCnTCr CTATTATATA TGCAACIGCC TSOOGCICTA 
10401 TUITACAAAG TACrrOGIAA TUIAAT3CAC AGGATCAGCA GTAATGCAGC 
10451 TCAGACIGCA TGCITIOGOC TnGGATTOC TAGATTTGAG ATTAAGGm 
10501 AGrGAOGriA riGAATAGCC (ZTTGAATTCr AAGIGCTGAT GIGAATATGA 
10551 TGCAAATATG ATGTACATAT TQOGATGIGC TGAGTAAGTA GATGIAGCAT 
10601 TIGCIAATGr TGCTATAGAT TTAGCATCIA AGTEAIGAAC CACMTCTAC 
10651 GACIGOGEAA GATTAAAAAA AAGITAGGGA CTTCAGGIAT GEAAAAIAIA 
10701 GCAAATICIA nTCTAOGAC TnAAAGOGT ATGIGIAGAG TTCTGAAAAG 
10751 AATITGICAG GCIOOOXAA ATOGACATAC TmOGAAAG CIGATCATIG 
10801 AAAAGATTAA TiGIGATOCIT TATTGEAACA TCTAACATAA TTACATTTTA 
10851 TTIAi'lUiAG AAAL'iTiATT ACCIACIUIC TCTTOCCTIT GCAC5\ATCAT 
10901 GCTGCTIGCr GIGGCAATAT TATGACTGIA TTGAGITCAC Ci'l'l'lATEAA 
10951 AAACAGOGAA OGAAOGAGGr ATGCmXAC TTGAGTXXAA GACATTCIAT 
11001 TITAATICrC ATAAAAGAGT ATTTCAGrCr GTIGCITGAT AACCITAaGA 
11051 TGATTATAGT GAGITICAGA TITCATITIC TTCTGAGCXC AGKS^GACGA 
11101 TGICTGAGIG TrmTAGTIG TTIGOGCAAG TGAGAGGCAG GAGIGAAAGT 
11151 GAACIGGCrC AGGITGAAGA <:AAATAGAAA AAAGAAATTT CIGATATAIG 
11201 ATAGAAATAA CIGi'i'riGAC TIGCTACATG CAGCIAAAAT AAATAAAACC 
11251 ATIt^TTCrr GYTSXXS&A CATITIGATA TATIGCITAT TGGTnTIGA 
11301 GGTIGCATCr TTIGQoCTIA TAATTTCTAT ATCAIGITTA TTEAGAIGIT 
11351 TGAGACTCCA OCAIGS^TT ATAIGAGAAA AATATTTEAG TCATIAAAAC 
11401 AATCIUITIA AGAAGQCEAT TTIATCmG ArrGEAGGGT CmGATTEA 
11451 TGAAAAATIA GGAGAAAAQG CATnOGATG GCaaOGAAAA A1TGGAGCIT 
11501 TIGTITCCAT TAGAAIGGAG AACATTOGAG GLAAGGQGAT ATACITIOCA 
11551 AIGGATOCCA TAAACnTCT ATAGOGKJIT CAATAAATAA GAAAACITAT 
11601 GGGAATAAAC AGGCACri'IA GATACAGAAA AATIGCEACT TATAGirCIT 
11651 AAATTITAAA ATGATAGTIT CnAAAXAOG TnGIGTCCT GCTITAATTA 
11701 AAAACAGGAA TATCTAAGAA TGAAATAAGA TATAAAAOOC TGCCAATIGA 
11751 ATTCTAGAAT TAAAATATAA AATAAAAGCT TTCTIGATTT TTAAK?nAT 
11801 TATAGGAIGA ATTATEACIC TXAAAAATIG AAGAAi'l'iGi' GCITATATCr 
11851 GTCATIGAGA AAAGAGTIGA OGnTICEAT GIGIGACIGA GnOGATITA 
11901 CIAAACIGAA AAGIOBGIGr CIGGGGGAAC ATAGOGAAAT GCIGIGGTOC 
11951 TTGAAAOoCA OOITGCACIG AGCXAGOaCA CIAGAGAGIG TCIUIGGAAG 
12001 TITACIAAGG CAAAAGTCIG GCIAGGGATC AAATOGACEA TAAAOOCOSG 
12051 TTIGTIGATr CEAIGGATIC TrAEAATIOC CACIGAATTA TCAITTOGAG 
12101 TGIAGGAOCT AGAAATATAT ATATATATIT TEAAGAATGT TCICTOGTTG 
12151 GIGIGTITGC OGACCAGCIT GATACIGTIT CIGITGIGrC TTTOGGQCTC 
12201 A3AAGGGATC CAAA0GCA1A TTICAGATGr OGrGGOaGCI GCirOCTOGC 
12251 ACATGGOOa: AGOGATGTOC CCAGATAATG ACACTIACIC OGICACCTQC 
12301 TAOOCAGTOC CEAAAOCIGC TATTCIATTr CTCIGATCIT TCmTCKA 
12351 GIGAATAOCA CCPOCPOVCA TOGAGTITCr GAGGGCAGAA ATCIOGATGr 
12401 CAGOGTAAAT GrrTOCmT OQOCAACTCr GGAIGTCGAA TGAAATGGCA 
12451 AAGIUIGITC ATTIGATCIC TTACrmTCT CTIGAACCrC TOTICTCIGr 
12501 COGIOGIGAT GAOGAGAGAT GATGAOGATT TATAGCIGAG ACTATIGCAG 
12551 TAGICTIUIA ACIGGIdTC CIGGCTIGAG TrrOOOCIGC TGTGAGATAA 
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"""^^AO^^^^^^^ ACICTAATTr GnUTGCAG?^ TAAACTITCr CAAATTIGAG TCIGnTCTA 
" * 12651 ClTilGiarr GGATAAAATT CTTCPOCATG CCITmTTAT TTICAAaGAA 

12701 AAACITAAAC TCATraGACT G?VG?^CAAGAT CTTCCTCrAG TTCITCrocr 

12751 CAATcnrcr AAACirrccr aggpatccoc atatctatut ATcrriATcr 

12801 ATCEATCTAT CTATCTATCT ATCTAmiAT CTATCTATUT ATCAIUTATC 

12851 AATTIATOCA TCATCTATAC CCEACATGIC CIGIUTCAAA CXTVEAACAAA ^ 
12901 TTAlAnTAT TCOOCTAAGA GIACTATITr AATATTnTA AAAATCATQC 

12951 ATCGcnurr nxACAGQcr Acmcrooc citgacictc tctgaaagtc 

13001 CrCCAAOOCT AACAGAGACE CACACACACA CACAGACACA CACACACACA 
13051 CACACAGATT TTCICIUTCA CTCTCCTGAG CraCTGTATT GCIXXTGIAG 
13101 ACia?IAAAT ACIAGITCGr CroOGCrCTC ATOGIOCIGT TIGIATCIAG 
13151 TAIGTIACIG TnTCIAAAG GATATTTTAA AAGAGIT3V3 TAC^^GAATAA 
13201 GCrmOGAG TCIGATGGAC CIGAATTT:^ GTCIGITIUr GTGACIATGT 
13251 GrGAACnOG GAAGATGACT GTACTOGnT GTCK^TTIT TTGATGIATA 
13301 AAAATIACCr TAGAAAGGGT ATIGK^^GGA TGAAATAAGG TAACATATOG 
13351 GACATAATAA GIUITCimA TATGCTTCIG TGCrOGCIQG TIUIUIGGIT 
13401 OGATATOGAT GTCTCIGGA^ TIGOCIGAAT TAl'l'lTl'lAA ATAOGGATTT 
13451 AAAAAATIAT AAAAGAAATA TATGATGATT GIGAAAAACT AAAACACIGC 
13501 ATAAATATAT AAATIAGGAA GAAAALTl'l'lA TGTCAGrCAT CCICAGAAAT 
13551 AACIACrCAT AQGrnTCOG CTATGOGIAA TTCAAGAAAT AGATTGAATA 
13601 TTGTIACICAT TGGATGATCT TATGATAOOG ATTnGAGCT TTL'l'i'i'i'iAA 
13651 ATITAACAAT ATGCCTIGAA TATATTTGGA TGrXATTGIT TTIAATCATT 
13701 TTIGAGGriT CCATTAGAGA AATGIGGGAT AATnGITIA GAGEATOGIT 
13751 ATTGATCAAC AC?rraGATIG TTICIAATTr TTGAGIOriA TAAAAATGCT 
13801 AGAGTAAATA GAGITOGAGA GAGATGTIGC AAAGAGGCAA CCGAlTl'iAA 
13851 TAAATAAATT CACIGGAC7IT ATCAAOGATT TGIGGAATGC AGAAATTIUr 
13901 TTAGIAATCT ATGIAACTAT ACTCAGGCIG ATAATQGATA GTIGGIAAGC 
13951 AGATAAGIAA AATTGAGOCA TATCnATGA TTIGIGnAA AAAAATTTTT 
14001 ATATGITAAG ACEAGAATGT TGOCTAGAAT TIGAGA£TCAA TATGAAAATT 
14051 GICICATrCA TnTACTOGT TrGGAOCGAT AIGGAIATTA GOOGGGCAAA 
14101 TCGGAACAAA TAGAOCACTT TAGATnGTT TGAAACTGIC AGOGITATCA 
14151 AGGITIAAAG TATOGAGCAT TTGATAGGAT TGOCITATAG TTQCTCIAAT 
14201 TEAACAACTG AAATAAOCAG GCATAAGCAT AATTAAOGCr OGACTGAAGA 
14251 AGTIGAGIGG GAGGAOCTGA aCIGIGGnC AAAGGATAGG GACTACrAOG 
14301 GTTCIAAACA ATOGAATAAA GXATAAAGOG GICICTCAGr CAAGOCTCAC 
14351 ACAOGIAAGA GGOGIGACIT TAAGGGAGEA AGATGAAATA TOGEAAGATC 
14401 AOGOGAGAAA TAATGCICTC ACITTi3GriA CnTATTIGA TrnTTIGATA 
14451 TnOGCATAA GAGAAATGAC TTCrATTIUr GTATTIAAGA ACICIAGAIT 
14501 TAGAAGACrr AATmCTGA ATOGGCTAAA AAATTAAGAT TIACIGCAGA 
14551 TGmrCACA TTAACAGATT AATGTGIGGA TGATTGIGAA ITITIGAAGA 
14601 CGAAAGAKTT TAACATGACT GACATCACIG AAAACCAGGA ATTAATAGCT 
14651 GIAACATIGA ATQGIACCTC ACCAAGOCAG CIAATCAGAA ATATGTCGIG 
14701 TGTTCACACr CIGIAAGATT TAGCi'i'iAGC GAAGGICnT GCAAAGATIA 
14751 ACCAAAIAAT GICTACAGAA GGTAGATCOG CTATTGIAAA AATCATITCA 
14801 CITTGACAGr ACAGAAGAAG GAOGAOCOCT TCIGnTIAG ATGIAGrOOG 
14851 TCCmTGAA aCIGTATGAT TGIGGAGATG TGAACTIAAC ATCTCOGAGT 
14901 TnTATATCT TCATCAGT3G AATGAGAATA ACAACATATA TCnGTGATC 
14951 TGACAGOGTr TTTGAi3Vn5^ TGAAATGAAG TAATGIGCAG AAGEAAOGAA 
15001 TGrGOOGAAT TATTATCATG ACitJi'lACIT TCATATCAAG TGAAGAAAAT 
15051 ATrnTAAAC TGAGEAGTIT AATTIAGAAT TTAAGIATGr Gl'lTlAAACT 
15101 QOGIGnAGC AAAAATTCAG TAGAAOGATG lAGGACACAC TTAAAGmT 
15151 GATGTAAAAT TIGIGAGITG TATTITEAAC TGAATCmT OGOGAlCTGr 
15201 CAAGAAATTA AOGTIATCCr TCAOCAAATG GGIGGGCTIG AAAAAOGGGT 
15251 C5VIGGA1AAA TATTTAGAGr TGIAGQCAAA AnGEAATCT TATGIATATC 
15301 AATACATATT GATTmTCA GGGAGAAOQC TTGIAGATTr GATGAAGAAA 
15351 TCnrCACAA GAGIAGATAA TCAITGATGT ATCACTIACC TAC^nGCTCA 
15401 TGAAATTTTG OCACnTATA TAAITOGriA GTIAGGGAAA AGGAGAGTAA 
15451 GATGAAGAOG GGO^AAAAAA AAAACITdT TGACAAAGAT GGAGAGAAGC 
15501 TCTGATCTCr TGIATIUnT TATGAATCGA GGAAGOCnT GGmTGAGA 
15551 AEAAGTOGTC TGAGAGITIG TGIACKrTC AGATAGGIGC 03GAGGACIA 
15601 CMTOGTGOC GATCTGCAGA AAACGAGAOG OGATATATTG ACIUIGGAGA 
15651 TCIGCOCnr GATTCTGCCA TCICTGAGGT OGCCCATGOC TiTHJi'lGCC 
15701 AGACEACIGC CGAAGITATA GAGACIAAGA GAGGGAGACT GAGEATOOGC 
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15751 TATGriG?VTT TATPACTAAT GAOGGGAGAA OnTAGAACr GGAGCTKAC 
15801 TOTAAACnT OG^iGCMGAT TEAAGACAGA. ATCAGGOCIG ATACIGmA 
15851 CAAAGCTGCA OCIGAAAG?^ CP3GAAGCTC AAATGTCEAT CITOGAAGAG 



15951 ATTAGATGTT CAIUITACAT ATCCCAAATG TCATAACITG CTTOCATGIG 
16001 ACirGAGIAC TCTOCACAOC ATTAAGCICT CACATTnOC AinTAGCAA 
16051 TGTCAAGCTA CXnCTITATC ATEAAATAT3 AACTAOCIGA AGEAATCAGA 
16101 aCATTCAT3G GACTIGAAGA AAATACraOG TATGTCnAT GCTOCCIUTG 
16151 TT3ACATCAAG TCACICATTC TACTTGJTCT TTTCIGATTC TAATATCOCT 
16201 GTCTCTCACT TUEAGAGAAT OGEAOCTGAA TaTAACEAC CTCATCATAT 
16251 TIUIGTCIGT T3GAATrATT CITCCACriT aXTOCITAA AAAnTAGGr 
16301 AAAGATATTT TCTAACIQGA AATATnTEA I'i'i'i'iATTTC ACATTTAAAT 
16351 AaTTTAGCrA ATIGIAGATG OCATATTGAC CITCCAAAAT GCndTCTA 
16401 ACnUTAaTT TATCnOGCr ATACCAGIOS ATnTCICIT ACXTLGCATOG 

16451 iGrrmrcr tagiuiqcjia agigatgiga tgagaigatc cnGGAOGiT 

16501 GGTIAGCATG AGrri'l'lTlG TGCUTAAATT AGICTGCICA TITIGriCAA 
16551 GGACITCACr AATATGAAAT AGITCriCTA TGACAAGIGA TmCTIGTA 
16601 C^CTAATITA GAGCAAAAAA AGAGCAGCTA OGATITAAAG ATAGTIGAGG 
16651 TAGAATATCA AAGCTACTAC TAATGGITIG GTCEAGGCAC ACIGGITATA 
16701 TATQOGGAAA AAAGGAAAAC TTGAAGGAOG AACATGAGAA TAAICIGGCA 
16751 TTTAGAACAG CAGAGGAGAG TCOCAGATGA GAAACAAGAA aGCEATATOG 
16801 ATATTGACAT GAATCAGCGA TTCrcrCITA GACATTGGAC OCATTAAGAG 
16851 AGGAGAAGAA GAGIGOGATT AAAGAAGAAA TCCTOCTCrC TAGOGCmG 
16901 AGAAAAGAGG GAAlTiLTiG CACIATCAT3 AAT30CAAAA TITATAAAGC 
16951 ATTTCCaCAA AGAGGEAAAG GAGAAGGAAA AAAAGTITTG AAGAOOGAIG 
17001 TCAOGITAGr TTGAAGAAAT AAOGAAATGA TGATCITICr CATOGAAGOG 
17051 CATGAAAGAG GGIGGGAAGG ATIOTGCAA AATATIGTOC TGTTAACICT 
17101 AAGAGGCAGG GCIGCGAATC AGAGCIGGAA CTCnGCCIT AGAAGAGAGG 
17151 CTAGAGGAAG TTEAGnTGT CCATTAGTCr AAAAGGAATC OGEAACIGAG 
17201 TiaXTTCACC OOaGAOCCEA TAAGCGAGAC ATATOGATTC TTATITGAIT 

17251 Gmrncrc aaaaagciga Tmrrmc TmriAATC acigagtcia 

17301 GGKSVrriAC AAS^AATTOC AAATAOCXnG aOCrCTAOCT Gi'i'i'iGGATC 
17351 AGAGIGTIGG AAATGIGTCA TTCAACAACA OGCITCCAAT GCAIGIGGIA 
17401 ATGITACaGA ACAACICTGA GAGTIUIGAT GIGAACITGA TGAIGGATEA 
17451 GAOGGAOCQC AAIGCIGCAG GGCIGGATGA GAAOCAGGOC AAGaGCTCTC 
17501 TTCAIGACAG TGS^GEAGAA TATGAAGCIC ATAGIGATGA GAAGIGKS^ 
17551 OOCAAATACT TTGIATTCAA CTGOOGGGEA AGIGAGOGGT Ca33GCriCr 
17601 AATGAGTACA GITATGIGIT TTCTAAGnT TEATTGAATA AAGIGAGAIG 
17651 GCGIGAGATC AOGATGTATG TTOGAATCCT AAAGAOGTOG TGnGTGnT 
17701 GmTTGAGA OaOOGEATGC AATTOGEATC GTAGmTTIG CirTTGIATG 
17751 OCAXCIGAG GrOCriGOGA TdAGAGIGA ACITAAAGAG TAAOGGAGOC 
17801 ATGAmTAG GATTCIAATT TGGTnGAAA TICIGGIGAT ATGITGAAAG 
17851 ATICnTAAC AGGAAAGACA Gi'l'lATAGCT TCCTCITGAG AGAAAATAIG 
17901 TAGIGGATCC ACIGCIGAGr AAGATGCTIT AATGAGAAAG GIGGGAATCA 
17951 GOGGACCAGA GCAGEAOGIT ATGnCITIC TCrCGITICr GTOCACCATA 
18001 AIGGTIGAGG GGAGGaGriC ATGGCN3JTG GACAAGGAGT OGAIOGriGr 
1B051 AATAATTTIG GCAGGIGTIG GGAATITAAA TTIGAATITT GITCGGAAGA 
18101 AATCATGTGA GCIGGAGIAG AAAIGAAAAC AOCGAIGAOG AGGAAAAGTT 
18151 AIGGTEAGGG GCAGGGTOGA TAAGOGAGIG ATGTGAITEA TAGICAGGAC 
18201 GIAAGCGTIG TGEAGAAGAC ATIGATTAGA AGAGAIGIGT GAAEATGIGT 
18251 OGlTiGi'lGT GITOlTiGiA CAATAGAGIC AGIGGGIW3A AAAIdTGIT 
18301 TGTTOGAGCr GATGGICIAT QGITGATTIG TATTUITITC CGITIGAAGr 
18351 TGTIGATATr IGGrXGOGAA CAAAGGATAT GAAGrCATIA TAGGIGmT 

18401 oricrriGGr teaagggag^ atateatata ataatecica ageegeeeaa 

18451 TGEAGACATC AGIAAOGEGA GKTTCATTC TCAGEAAAEA GCAAAAGEEE 
18501 GOCCATAAAT TGIGATTTAC GTGATAAAAA ATITGAGAAC ACEITCAAGE 
18551 ATEEEGATGE GEEEGATEEA GFEIGAAAAT lACATGEAGC AGTEAGEOCA 
18601 GAAGGCrGAC AATTGAIXTEE T3GCAG0CAG GTKXTrCTA GAAIGGETEE 
18651 CAGAAGGTEE TGAGGEAGIG TGGAGTGGIG GCAGEAGIAC TITGGEGAGE 
18701 GEAGEAGGTE GITEEOGrCA TEEAAAGEGA TGEGATEATC AAAIGGAAAA 
18751 QGEEEGEATC TE?y3GAG0GE GEEIOVICEE TAIGEEAATE ATATIGEEAE 
18801 TGAGEGGGGA AGGEEAGIGA GGEAGGIGAA ATAGAGEGTE 
18851 OGAAATGATE GETETEAAGA GIGAAOGACE AGEGEEEAAG AAAAATGGAA 



15901 



AAGEEGGAAG GAGEGGCAAA TAGACAATGA 



GATErGQGGG 
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.8901 ATGAATCrrC ATTAOCrCIC TAAGAGAAAT TTAAATCAGC TATAACTnTA 
18951 TOIACIAAAT ATGrCITCAT GATTAGCAAT ATAGATATAC TTmTATTA 
19001 TTATTITCAT TTTGAAAA:?! GAi'iTl'i'i'l'i' IGIA^GTITA AAAAACAAAG 
19051 CTT'raGriC TTTUrmrC CNJmnrCC OaGAGAAAAA TGGAAAOaCT 
19101 GrCAAAT?^TT TCCATCAOGG GGATGCrXCT CATGIACXnO CrXQCOGOaC 
19151 TCnTGGTIA CdAACmr TATQCTEAGGT CACTCIGAAA GICATTCIUT 
19201 ATATGCAAAT CCl'IUl'iAOG CraGTOCTIG ACCraGGIAG CTATGAinT 
19251 T?y\AAATIT3C CnCTATAAG CATa:T::TAT AGATGACACA TATTCAATTA 
19301 ATATACIATT TIAGTITIGT CACTraACCT GAOGAAATGG OGOOGATTC 
19351 AGOZraGCTA ACAAsTCTACA AGAATITCIG AATTAACAOC TATTTIATAA 
19401 AAAATATOQC TCAAACAAAA TTATTITOCr CTAGOGATAG ATGATATTTC 
19451 TOT3GCIAGA CTOCATAGK: CAACKAQGC TAGAA£:?IGAT GAGAATGAAT 
19501 CGA'nTGCAT GIGATAAAGC TCCITIGATG GAATEATTAA CIOXACACA 
19551 AATAGGAGQG AAACTGCCAG GTOCrCAAGT TTGAATTiaC CTCCTCmA 
19601 CCAGTCAAGr GAAATCrOOG AGCTraOGAC TTrAGCTAAA ATTICIGACA 
19651 TATCCCATTC TATTTIGnA TACTAAAIGA TITOCTAAGA AAGAGGACAT 
19701 GAGAGAATTT GCITCAATCr AAGAATGGAC GACXIAAAAAA AACT3ACIAT 
19751 OGOCACAriA GATTATGCTT GCAACATTIC CrcTCTOGGA TCITAACAGr 
19801 TCACAAAGQ3 AGEAGGATIG TACIGCITOC ATGAAGTCTG GCGAGATAAA 
19851 CAGATITGAT GGAATGACAT ATTGAarrOG TAGGATATGT TTACATGAAT 
19901 GAGIGIATCA ATATAAATAT ATTmGIAT AAACXTTCCTT TEAAAGmT 
19951 TAAdTAATT TITITCITAC TGACTIGGIA AATIGAATIG CATGIATGAC 
20001 AAATIGIGGA GGAAAAGATT CAOGAGTAGG OGACmTTIG CITAGGl'lTi' 
20051 TTTTCIATTr GACTAATATr TGACTATTAA OGAAACATGr GCnTAGATT 
20101 QOGCATTAAC TnTIGCa3G TTGIGAAAIA AIGAATGAOG AGGTGAATAC 
20151 TACIGAAGGT ATTITGACTA Ci'lTnGTCr GATCTrGAGG TGAAAATC3CA 
20201 ACrAa3CnG ATICCATAGA TAiTi'iUi'iG TmTTIGIGC nGGAGTOCr 
20251 GAATGAAGGT Gi'i'i'lCAAGT AGGGCIGCAT CrrOGrCITA GAGEAGTACC 
20301 GACIGGGAGA CCATCTAAAA ATEATACTAA TTTATOXTG GAOGITACTr 
20351 ATACITAITr TAATGAGTIT GATAAGACAA GCAAAAACTr GAAAGAGCX3C 
20401 AAAAATATCT Gi'lTiAGIGT GGIGATGGAG TGATAGTIGT TGAGCITGAA 
20451 AAAAiaGTAG CAATGATTCA TOCTAGAGIT TACAGACIGG GITIGIAACX: 
20501 TGGATGAGGA GIOGCIGGAC AGGrAaGGAC AGGGGAGGIG GrAGaCIGQG 
20551 AGAGACAATA TGIGOaGCIT QGGrCICrCA TOGCCITGAA GAAGAGCACr 
20601 TTGGTCICIG TCIGATTIGr AATIGCITCr GEACAGOOGA GAIAGATITA 
20651 TCACAATGIA AATGAGCTIG AGAGGCTCTT TAi'lTiGlAT TAIAOnTCr 
20701 GCAAOGTIAT CAGCITCAaG ACCrcrnGT TGATTIGAAT GAAGGTIGGA 
20751 TAGCIAATGA GCIGAGAGGC AAGAOCAGAG GIGCXIDaGAT TCOGAGGOCT 
20801 AGGICrnTC CICIGTTCIG TGITCrCTCr ATAAAATGIT GCGATAAGIG 
20851 AOGIGIGCIG ATTIGAGAAC AOCAAGOGGT TTCATTCICT TTITaCIGIT 
20901 GTAGGAGAAG TIGAAGATGA ATTACITGAT GCCIACAGGA AAGIGEAXAC 
20951 ATTAGACATC CCTCnCTGA TOJnXGGCr aGGAGTCCIT GIGGCAGEAA 
21001 GACTAACIGT GCaCATIGTC CICITaCGAG TAAGEACATA AGACITIGAT 
21051 GAAAGAAACC TACrTGAOOC GATAAATEAG TACATGIGIT CIADCTIGAT 
21101 TTIGAinAA TTAIAGGGIG AGITIGCAAT TGCAAIGCCT GAGGATATTA 
21151 TTITOCrATA GCAnTIGAG TGACTIAAAA TIGGOCATTr AAIGIGTAGA 
21201 TAGAGGAAGT AGirKAGGr GGIATTmA TAGIGIAGGA AAAAAATCAT 
21251 AAAACmrr TTTAAACTGA AAGTIGAAAA GIGGAGCIGG AGCnCIGTC 
21301 TTGIGGATTA GTAAAACIGA GTAGGAGTIC ATATAAdTT GGAAOCnGA 
21351 AAGCCAAAAC CATATTAACT TTGAAATCIT ATTAAATTrC ATGACAGTIT 
21401 TGAAGGGATT TCALL'ri'i'i'i'i' TCGAGITIGr TGIQCIGCAA TAATATACAA 
21451 AAGTIGCCTT mTAACXTTG AIGCCTIGAA GGCTAATGAA AAOGGGATrC 
21501 AIGrrAAGIA AATTAIATAC CAGAAAAAAA TiTi'iCAAAA AAGfiGTIATG 
21551 CIATCIATCA CATATCTCTC TCACACATGG OGTCIGCGAG ACIGACAOCA 
21601 GGIGAOOCCT CCCIGGCATT TGrCATIGGT GIGAGITIGr TCIGAGATOC 
21651 GAGA3CAGAG CTGGIAGIGA AGATTIGQGC TGIGIGAGIT AAAACCAOCA 
21701 QdAAGGATA AAGAGAGGIC TIGACOCTOC TGCXAGCTOC TGnTCATAA 
21751 ACACIGAA1T TACTCAITGA TnGAQOGGG AAAAAAATAA GIGACAGAGT 
21801 AAOCAGCACr GTCOGGACA TAAIGITOCA TACAGaGCIG GCATATGAAG 
21851 ACIAITICIA TAATGACACT GTGGICACIT TAAAIGCAGC TTGIGIGCIG 
21901 AAATATATTT TGGCACATTC Cl'iTi'iCATG AGIGCATGAA ATCAGATOOG 
21951 TACEACIATG GIGGdAATA TTTTACrCIT AAATCATGTC TIGC?CIU1AA 
22001 TATATCIGAA AGrATTTGAG ATGACATACA CATAGCITIA GOCIAAAATC 
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51 PCCTCOJTCr TGJJVPCAPG AGAGAAG?^CA ACIATAAAGA GAAQC^rATAC 
i^jQP^M^^iiilOl GATN33JTAA AATTGCCAQS GAAACAACIT CACICAGAAA AOGATATCTG 

22151 c^GOxnur TmATGiar aaaaaaatca ctgaciaaat ttiqgcacag 

22201 iriTAAGCATT GACATCATIG TAGAAT2AAA OCATAAGAAA TCIGIGATXTT 
22251 QCnUIGTAT TCCITEATTC ATATTOVTAT AGIUi'lTlCA AQ0CATG:JIT 
22301 TTAAGOGATT GCCAGAATTG GCGATOJrCA GAGAGACAGC TOCTXAAGAOT 
22351 TTCAACTAGIG GAGCTCATAG CCCAAGACIG AGOGCTGCAA TTATIUTCAT 
22401 OGGAAGEAAA ACTCATTIAC TGATGAAGAT TTGACXTCAG GATGGAAAAT 
22451 OGAAATUTGC CCTTAGAAAT TCITAOOCTA TGIIGAGAAAT AAA3CACIGA 
22501 TATAAATCIG ACCATCAOGA ACAGCAATAG TOTGTAAACA TIAGATOXA 
22551 TTACi^AOCAA AATTGAOGAT AAGAAOCAGA GITCAGAAAA ATGACTAACT 
22G01 GCTCTOCrTC ATXAmTATT TOCACICAAC ATTAGCATTT ATGAAACATT 
22651 TTGCACATrA TCCIUIOCrC AC(XriGCAA TrrrTAGATIT ATATAATZCG 
22701 TCTAAGIGCr CCACIGCXXX: ACAGAGTGAT AAlTraCCTOG GACITGCJIGA 
22751 TGIGCACAGT GACTGGCAGA GAOOCTGAGC TUIUKGTGC Tro3GAAGAA 
22801 AAAIGCTCIT CAAAT3AATC TTGOCTIGrC TTGAAATOTA TAAACIG2CT 
22851 TrrCTAGCAA AAGCATAGAC ACIUTTTOX TIQGIGACAT GIOTAaWV 
22901 TTCSGCIGGG TTGAGGATCr QGGCTAAATG AAOCAAAOCT CCCIATACAT 
22951 CMOGATACA CAGAGATOGT GACAGAGAGT a^TGACITCC GrGAGTGGAT 
23001 CTGAATGAAG TOCTCIGAAG CTAAATTGAA 'I'i'i'i'i'iTiCr TTACTAAAAT 
23051 GATAAAACTT GITATiaaCG CmTGCTIG TTTATTTair ATAACITAGG 
23101 QCTGAGATTr TCAAIGIGTC AAATGCTGAC TCACAGGATC GTICrCXriGA 
23151 CAGITIATrr GATnAAOGA ACIdTGAOC AGTAAGnTA TTEACTIGCC 
23201 TTGATATUrC GACACATEAA TAATAAAACT AACAAAAOCT AATCIGAATT 
23251 AAAATCTATC AGCnTAGGC ATEATTITTr GITCrOCTrC TTTCAAGATG 
23301 GTAACraOGC TCIUmCi'i' AOGAaCriGA GAAGATATCA CIGGGGrnG 
23351 'i'l'i'i'lCICIA CITCATTEAT TATCmUIT TTITaCAATC AOGriAGnT 
23401 TTTCCi'i'i-i'i' AGEAAAAGGT GCATAGIAAC TOITIGrAGr ATTIGrXGAA 
23451 GAAGIGAATA AAIGAAATGA ATTAAO^rAG TGmTGACT AGGAGCQCAA 
23501 GATTICnTC TCTCnAGIA araOGTOGGG TATCAGTIAT OGAATOGCAC 
23551 CIOGITOCAG AGGACIGATC ATGTGATnT GAGCITATGC TTOOCmAT 
23601 GGAGTAAAGT TTOGATATTT CCATAAAGAA GAAGAAAOCA AATAATCCTA 
23651 ATGGATATAT AATGAAGAGA CAGATGAAAA TTTGACCIGC GATGCX:TnG 
23701 AAAAAAGATC CCEAOCTACr TGEATITCAT CITATAATTA AAATCAGICT 
23751 TITGACrrAT Gl'i'i'iL'l'iCA GATCraCIGT TTTGAAGIGT ATATAGATAT 
23801 GAACATAGAA ATGGAOOGIA TATIGCEATC AACIGGAGIG GAGGAGKSVT 
23851 TaTCAQGrrr TOCAACATCC TTQCCrrAAG GAAACCIGCA AAATCAAAGT 
23901 GIGAGCIA03 TCIAAACAAT GGGAGAGGCr TTmTmT TTnAAGAGT 
23951 TAGAACTAAG ACTUTCACTr OCT2CIGP3C CTQCAGATTr TIGADCITCA 
24001 GATIGOGOGC CIGCATCAGA AXACAGCAOC OCCIAACAaG CTOinGriCA 
24051 GGACTCrrrC TCIGGAAATA AGAGATGTIG TUTCEAGAQC TGGAIAGAAC 
24101 CTEAATOGAA TCAnCTGOG TGAGAGGCOC TOGATGGIGC TOGGGACCTC 
24151 OCTGACXrAC AGGATCTGAC OCACATITOC AGGITCCIAG OGACnGIGT 
24201 GAGEAAAGAA AAAGGCAGAT AGCTAAGraG AAGAGGAGAT GAGGCnaJT 
24251 GGGAATCAGC CAGIGGICIG COCTAGCAAA GGTAAACAGA ACIGCIGQGG 
24301 GCmTGGIC CIAGQCTGAC TACrGAGG:5V OGGACTnAA CAIGGAATGA 
24351 OGAGGAAGIT TOdTOGrGA TCi'i'l'iOCAC GACCAOCACA AGOGTAITEAC 

24401 cixxcToccr cmGcrciG TTOcrcrcrr ggggaatgca ciggaaaoga 

24451 OCrrCAGITC TGTnOGAAT TTTCGmTTC CTTATTCAGA AAGAGGAAGA 
24501 AGCnriGGA TnACICCAA GOGITCIAOC TATTAnOOC ATAAACTITC 
24551 TGKSVTCrCA TATGATEAGG GCAAAlGTm ATCTTTUIGG GAGOGAGGAG 
24601 ACIGCTITGA GATTGAGAOG CCCTG^CNT ATAGCSOGC CTGIAACrGA 
24651 CICmACTCA GCTTATIGAC TIGAATOCAC CITTTEAACA AGIGACIAAA 
24701 AAAGAAACIG TGACIAITCr CIGAAAATGA GCCIATATUr GAlAdTAIT 
24751 TATTCTGITT AACACIGIGA AACAAATTAA GrOCTCIGGC ACmTGIArA 
24801 TACGATAAAA AGCITATTIG TAMOCTACT AATIGGAOGA GTlTiGACAA 
24851 TATIGAATAA GGACIAATIG CAGATGATAA TOIAGAATTA TAGGCIGCTG 
24901 AGGAAAACAA TATGACAOGA TTIGCmOC TCAGrTTOGr TTKAGAATG 
24951 AGinCATAA TGITCACTAA TOGAA'l'X'lTi' AAAATXXTIT AGAAAGTEAT 
25001 TdTAAACTA TTIGCAGACS^ CEATCIOGIT TTTGAITCrA GAAAIGAAAT 
25051 TGCCmTCA aCGTAAACAG AIGGCCITAA TTITIGGIGG AGIGGTATGA 
25101 AAOGAATGrC ACATGAGAAA CIGCAAGCEA TTTAGCTIGA AiTril'lGTC 
25151 ATTGATAGAT GTITGAAAAT ATATnTAGA TnTCTCIUT TTEAAATGAG 
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:)1 TTCCTATCTC TGCAOCTIAA GK^^CITCAG AACEAAAATT TTAAACTC^ 
£251 CATCAATCAC AGGATITCCA AAAATGIX^ CrCCIAGCIT AAGOSAAGTA 
253 01 TTCACITATT GGAAAGCTGA TAiS^fjEAATT OCACTAAGTC CAAAAAC7ICT 
25351 OCTCIAAAAG ATTCCAAAGA TAAGAGIGIT TTCAACTTTG TCAAGCIGTA 
25401 CAAACAGAAA TX^TCACTCOC TCOCrCTOaC GACAGOGATC TTEATOCAGr 
25451 TACAGCAGOG TAACTIGAGC AGCIGCIGCA AACTGAGGCT CrCITGAOCC 
25501 TICGCCIACr TATTTGAGCr GCTAAAATAG OaCTGAAATC TCTCAAGGAT 
25551 OCrCAAGOGA AGGATAAGAT TCCTACTATT GAATTTAATT TAAGCi'i'i'iA 
25601 TTGAGIGCCT GCIGIGTGGA GAACACTAAG CITO^AAGTC TGAGGAATCT 
25651 TTAGATEATT AGCTCCIGIT OrriGOCrTT GATAGAITIA CAATCTATTG 
25701 ATAOGGAGAG CIAAAAAGC^ GAGAAAGAGG AAGGAGGAAA CATAAAAAOG 
25751 TCAAAATTTT AAAATACCAT TTLAAAATTT TATTTTAAAA TGITAAATAC 
25801 GATQGAAAAT TAAGGAAAAC CTAGATTCAT AAAAATTGCT TTCAGAATCT 
25851 TGIGTAAATC AATTCAGIGC TIGCCCITAA IGTCICATCC AGICIGATGA 
25901 GAGATGrnr GTGATGAAGA AGOGmTAC TATGnTCIT AATEATGIGT 
25951 CITOOCIGIT ATCTCTITCr GAOOGAGATT Al'l'lTiAACA ATAAATTCIG 
26001 AAAACTAAGA AAGIGAAAGC ATAAAATATT GTCITATAAA ATAOGCCAAG 
26051 GAAAAAATGA CACTCCATIT GAAATATGAA AAGTEAGCAT GAAGACIGGA 
26101 GAAGATGAAT GTAGAGTCAT GIGrXGCTEA GAAATGIGGA GAimTCIGA 
26151 GAAATGGATC TTEAGQGAAT TnGTGATIG TCGAAAGACC ATAGATIGTA 
26201 CITGGAGCCr AATTGGIQGA GCCTACTATA GACTAAGGCT ATATGGGATA 
26251 OCCrAGIACT CCTAGGCTAC AAACCIGTAC AGCAIGTIAC TCTACIGAAT 
26301 AGIGGAGGTA CCIGTAAGAT AATGGEAAGT AITIGrGrCT CCAAAOGEAG 
26351 AAAAGCEACT GTAAAAATAC AGEATTACAA CCITAGQGrA TCACIUrCIT 
26401 ATATGIGGIC TGITCTTGAC OGAAATGACT ATGCITAATA CCACIGAACT 
26451 GTACACITAA AAATGGITAA GATGGITywr TCTATGITAT GTAIGnTEA 
26501 TAATAATAAA AAAATIGAAA AAAGCATGAA GATCrmCT OGGAAAAAAG 
26551 AAAAAGAAAG AAAATGGATT AGAGTGATGA GAATATTIGA AGEAATAGAT 
26601 AAAGTCAAAA ACAAAGAAAT GATCTTGCCT TTGAACnTC TIGmAAGA 
26651 TTOGTACATC AGIGATCAGA CrGITAITrC OCAAAOGACC CTTCAGCIGG 
26701 ATAOGAGATT TCCTGATIGC AGCIGTGCIT AnGGACITA ATAATGITCT 
26751 GGTCATaCIT GIGCGAACEA TAAAATAGAT dTOGGATrC ATAGGIGAGT 
26801 TTGAGAAAGG CITCAATriG GTCAACCCAA ACICAOGOCT CATIAAATGA 
26851 TOGAGAOGGA ACGAGIGCIG GGTCATOGAG AKXXXGITC TTTCrGAOaC 
26901 TCATGGAITC CCITIKrOGC TOGGAQGCrC TQGIGATIGA QCIGCTCACr 
26951 GICICITOCr CCmACIGAC ACIQGGAQOC ACCITATAGG TGAmAGTC 
27001 AAGCIGCnr TTCTGATAGA TGAGGAAACT GACOCCTATA AAAGTCAAGT 
27051 CATAEAOCIT GGICTGGAOC GAOGATTrGG ACirAQGEAT TAacrCGAOC 
27101 ATGAOGAAAA GAGGAAGATA GATTITACCr GCGAGAAGCr CTCIGATACT 
27151 ACGAGEATGA GCIGAAGATT GAAAGGTATC TTCAGAGGAA TAGGAGGTIG 
27201 AITAXATAAA GnGTATTATr AGmTTTOOC CATAACIGCA TOGICIATTA 
27251 ArnTGAITC TACTCATIGA aOGiTIACIT AAACl'l'iAAA GACAATCEAA 
27301 AACITTAAAA GAAOCATOGG TAGGTCAdT GCAAAGTAAG AGGIGGATAG 
27351 GGIGICTCAT GAGITC3^GGC ACCITAGEAT GmTITATAT TACIAATaCC 
27401 CIOIAAAnT GIGTEAAATT CAGOJi'i'i'iG TTGCITATrA TATGITGGAT 
27451 ATACITATCC AGCITIGATG TTAGGmCAT TITAATIGrC TCEAIAAACA 
27501 TATCl'iL'iAT GAATAAATAA CCAAGATGAG dTATGTGAC TTAAGIGIGT 
27551 GITTTTAGIG CEAAGEATAG GATAGCITIA TATnOGnT ATEEAAAGTG 
27601 TGIGCIOGGA TCTCCITIGC TAGGAACIGC TGGGEAAGAC ATTGAOCTIG 

27651 occiGTGrrr (j]:<znx:^s:pc^ gggcitcitc tgocactatg cigateitea 

27701 TTCETCGAGC AGi'l'I'lTiAT CTIAAACriG TCAAGAAAGA AACITEEAGG 
27751 TCAOCCGAAA AQGEGGGGGT AAGEAAAGCE TGCAAITICC CCGATEATEA 
27801 GEIGirCrrC GAACEACTEA GAA3AAACEA GAAAATAGAC ATAGTEGAGA 
27851 AAAAIGAATC AATGEACAAG AAOGAAAAAT GAAAAATGOG CEA^S^CTET 
27901 CIGGEAGGAG AGAAAGQGGA CATATTECIG AAACECAAAT GATECEACEE 
27951 CAAAEATCAA ATATCCEGIG TTGAGTCEGE CATACMCTC AAAIAGTAGT 
28001 AGCCTEECGC ACAGAGAGAT AIGCErCAGG GAAAIAGGAG TGTOCAATAC 
28051 GAAGCEGCIG TTGIGCEATC OGTGGAAAAT GATGGAAGAA aGAATEAGGC 
28101 TCCCEAGOGG TGTEATGGAA TAATEEAAAE ATTnGGIGA TGGEEGrEAG 
28151 GEEEGGAAAG CCAAAOGAAA GATGEIGCEE TRii'iTiOX: TTOGATAGEA 
28201 OCEGTEGrOC CEGGEGEGGA CEAAGATGGA GAACAGAAOC ATEGATOGEE 
28251 CEC?EEAAOCE Ci'ilAGATAC AAAATACAGE CEEATEAAAT TAGAGAGEAC 
28301 AEATTECETT TCGATAAGAC TACEATAGAA AGAAATGCEA GAAATAATEG 
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01 
8451 
28501 
28551 
28601 
28651 
28701 
28751 
28801 
28851 
28901 
28951 
29001 
29051 
29101 
29151 
29201 
29251 
29301 
29351 
29401 
29451 
29501 
29551 
29601 
29651 
29701 
29751 
29801 
29851 
29901 
29951 
30001 
30051 
30101 
30151 
30201 
30251 
30301 
30351 
30401 
30451 
30501 
30551 
30601 
30651 
30701 
30751 
30801 
30851 
30901 
30951 
31001 
31051 
31101 
31151 
31201 
31251 
31301 
31351 
31401 
31451 



TmrOGAAT 
AGGCriGAAA 
PGTCATnAC 

crracriGGA 

TIUIGAOCTA 
CEAGAGOGAT 
C^TEAQGAOG 
AAATGCIGAA 
TGIGTTmTA 
TGTIGCAAC?^ 
ATCA.TATCAT 
GATIGIAAGC 
TAAIAATCAC 
TTACPXATT 

crrrcrrcAA 

TCAGATATTA 
AIGIAAACAA 
AACAGAGTCT 
TTCITZATGA 
TCCPCCAAAT 
TATIO^AAAT 
ATTOGAATCT 
GAGIAATATC 
AGGATOGTIG 
OCAOJITriG 
TIAOGAGCAG 

TiTocccrac 

TITjCATTCrC 
GATTTAAIGT 
AACAGAGGCA 
GATCAATGGT 
AATAnriGC 
ATA!3OTTIT 
AATATAI30V 
CrXATAGCTA 
OCTAAATCCT 
lUnCGATCC 
CIAOCXX?IA 
AAACICT3AT 
GAGAATGTIA 

TiTcmcrr 

AATTAAGITG 
GICia3G3AT 

TcrrmGCA 

TCTAAAATAG 
TCATTGOQCA 
GCTAACPCTA 
AAACITACAT 
T3GAGITCAC 
TATATTTEAG 
IGAGCEACTA 
ATTmCAnT 
AOCTATCrrC 

ACCATmrr 

AGAGATACAT 
AGATGAAAAT 

AACmrCAA 
TX^TtJTCATTr 

TATATiajrr 

ATUEATAGOC 



aaogaaatat 
agaatattk: 
cmoaGATG 

CX:AGCAGdT 
TTnTACAGC 
TTITCAGnT 
TOAGIGAAA 
TnAAGTATG 
GTATTITATA 
TTATATGIAT 
TCIATEAAGC 
ATATGCTATT 
ATTCTCTAGA 
AOOCJrATTOC 
GTIGAACCIG 
CTAGGATIUr 
AGAATAATAT 
CTAAAAAACA 
CTCIGAATIA 
TT3GA/02AT 
TOCAAGCATC 
OOTACAAGT 
TATTCATAOG 
AOGAOGCAOC 
TGEATCIGIT 
CEAGAGGAGT 
AATAAOQCTEA 
OJICrOCTCA 
AGAACATGAA 
CTKTCrCAG 
AAAGGCATGr 
GAAACACCAG 

o^^crriGiT 

GIAAATGITA 
OGQGCnTAT 

aacioxt:?! 

CnGTCATAC 
TCATTCAGAA 
TTUITIGATA 
TTGCEATATG 
ATCTAAGCTA 
CACACTCAAA 
GraOGCTOGA 
AATTTAAATT 
AATATTTAGA 
AATimATC 
GAAnAGITC 
TTEAATAGAG 
03AAGAAATG 
AAAAT3GATT 
TAOL'i'i'rX'iT 
TAAAGOCAAT 
TTAmGAAA 
TGACrCATUr 
TGAATGITGA 
TCITCATAAC 
ATGATAGCTT 
TTTATnTCT 
'I'l'ITi'lATTA 

GTOGATGAAA 
TATGITCCAT 
ATCATOCTGA 



TATCnrCAC 
TTACTGAATT 
nUIGGGGAC 
GCKGGAAAT 
AGIGIGACCT 
GACGAATIUr 
AATGnTAGT 
TGATAAAATG 
TUIT^GGrATG 
GGACACACAC 

TTrroGmT 
crmriAAC 

TTATIGAATC 
ATCATCTTOG 
TAGGnGTAT 
AAATIGACIG 
OGCTCAATAT 
AATTGIACEA 
GAGGCnTAA 
OGCACrCATT 
ACIAACACAA 
TATACTCCAA 
AAATAACAGG 
TGCAGAAGAG 
TAAAAIAAAA 
GK¥iGAIGAT 
ACIGCATGEA 
TOCIGCAOOC 
ACAAAAATGC 
TCITCCACCA 
CrrAGGAACT 
TCIAGfOGAT 
TGIGACATTG 
AAAGATCEAT 
TTCAATCATA 
TCIAGAAAGC 
TTITGICATr 
TmTAGAAG 
GATmTATT 
TTTATGEATA 
GATnGATTT 
GCATrmTAT 
AAATOCATIC 
TGAAAOCTAA 
ATAATAGGAT 
ATACEAAAGT 
AGITEAGCIC 
TGTIG^^GIGG 
CACAOGATGA 
TAAAAATTOC 
TCAAATCATA 
TATGAAtGEAC 
AAAATCIGOC 
AAAAAGEATA 
AGAGGATCCT 
CAGGATTAAA 
GK3CIGATEA 
mTTTCATTr 
GAGAAAGIGT 
TGIGATTTTT 
CAAAATTIAA 
rXTGATTGIT 
AAATAATrOC 



TGCTEAATAA 
ACTCIGAATT 
TTCAGGATAA 
TITAACICrA 
TGGCAAATTA 
GAAATGAGIA 
IGCTGCnTC 
TAAGGOCIUT 
TACATATATC 
ACACACATAT 
GITIGCnTA 

ciGCTGrnr 

TTmCIGTC 
TCTACTAAAT 
CTCTCCACVG 
ATAGGTEAGG 
ATAGATGAGA 
AGrAT3GnT 
TTITGGriGr 
ATAATIGAGT 
GGAAAAATAG 
AAGAIATTIG 
AAGATTOGAA 
O^AAATGACT 
OGIGIGGIGC 
GAAGGIGEAT 
ACAATGATGA 
OGIAAAAAAG 
OGrOGIGGCA 
AAGTGICIGG 

GirrGrGriT 

GITATTCTGA 
AGTICTGACr 
AnTAAATGT 
GAGCAACAAC 
AICTGGmT 
GAACAATGGT 
AOGACAATOG 
TGATAAAGAT 
lAATGTEATr 
TTAi'i'lTi'iA 
TATTGGGGGT 
AIATGrATGr 
GTEAIATATA 
GATGEAGAAG 
TGl'iUriTiA 
ACAGAGGIGT 

TIUriGCIGA 
TACTGAGAGG 
GTAAGATCAC 
AAGEATGATA 
AAATAGGATG 
GAAAAATAAA 
ATAGIGOGAA 
ATTGATATAT 
ATITAAOGOG 
ATT^AGATGGC 
ATATIGGEAT 
C^TGEATIGT 
AATOGAAATA 
AOCITIGACA 
GATGrXATTT 



AGICATGITA 
TTIACCriGA 
TnGGTATCA 
TGrAGTOOGT 
CnGTCCIGT 
CAATTATGTC 
TTATATGTAG 
TGIGGICITA 
GTEATATATG 
ATACALTi'lT 
TAAAATTAGA 
TCAOCTAAAA 
CCnGATTIT 
GAATIAAGTA 
TATTCGTCrr 
CGIQGGCATC 
TTGOGATATT 
CIGIGCrCGT 
GGnOGAATA 
GGATITATGA 

TTiurnTrG 

AATEATGITG 
AGAOGmAC 
GmriCTGA 
AGAnrCIAC 
TTnGGIGCT 
GATAGIACTC 
TAGGAAAGAT 
AAGCEATCAC 



cnGmAGGrA 

AGAGGAAAAT 
TCTATATICA 
TAAAAGAGEA 
AAAAATAADG 
TGAIGITAIT 

craazTGroG 

lOS^TAGA 
TGAGIGCAGG 
CIGEAGTCAT 
GATGIGTAGT 

aaaaGGAOGG 

GICEAGAAAT 
GmOGGAAT 
AAAATAAGIT 
ATnAGGATA 
TTGGrGACAT 
AAGTATAAGT 
Cl'iL'lATAAG 
GEAAAAGITG 
TTGIGTATIT 
GAOGEAGnT 
TTI5GGATAT 
AAGTGGAAAA 
ATAATAAAGG 
ACIGGAaOGC 
ATTIGrAAAG 
TCAATGIATT 
ATAATAAATG 
TAAAGAATGT 
GICTGITICr 
CATAAGIAAG 
TGGGAAAATA 
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31501 
31551 
31601 
31651 
31701 
31751 
31801 
31851 
31901 
31951 
32001 
32051 
32101 
32151 
32201 
32251 
32301 
32351 



G?^TATrT?\AT OCIMATTAT TATGATGATT ATAATTTIGG CATCACAT?^T 
ATAOCAOCTA GAATGAATGT GGAAGAAATG AGTLTlTiAT GGTIAGmG 
AAAGAATCC?^ TTGAAGATAG AAAATGAGAG AATAGAACJiA AOTIGAGAAT 
ATTAAAATAA AGAGCAGAGfi. AAATATOGOG GCAGGGAAAA CAIGIGAGIG 
CTAAGOATIG ATmiGAAlG AAOGATTAGG GGGATIGATG GATCACAOGG 
TAAGTATATG CTIAACmA TAAGAAACIT OCACATAGIT TTOCACAGIG 
TTICTACXAT TnCAITTa: ACGOGIACIA OCnACAACIT CXACIGACIC 
GAGAGOOCTG OGAACATTIG GIGnurCIT TIGGATmA GOCTITCIAG 
TGGGTCIGAA ATOGimCTC ATIGIGATTT TCATTTCIGC TTCIGIGACA 
ACTAATGriG AAAACi'lTiC AAGICTTTAA TGOrCACTCA TATATCnCT 
TTICTGAAGr GIUIATTCAA ATCmTQGC CATmTAAA ATnAGGITA 
TGIGlTl'i'iA TIGGGIATrr GTASAAGCTC TTTAAATATG GATOGATGIC 
CAGATIGCCA ATAIATTTTC CGAGTCTATG GIATGGriGC TTATTITOGr 
AAAQGICTCT TAATTACAIG TnUIGGGGC GAGGTCAOCA TAGCTCAAAG 
TrnGCAATT TATGTGriAA TGAGATAAIA TTAATCAGAG TCGTATAGIC 
AAAATTAAAT GmTGATGr CCIGGGCOGA TATAGGIAGG ACIGGATGAT 
CTAAGGAAGA TGGAAAAAAA AAAAAACAAA AAAACAAAAA TAGTACTIGG 
AAAAACITAT TTIAAATTAA ACA {SEQ ID NO; 3) 



FEATURES: 

Start: 

Excn: 

Intmn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Excn: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exan: 

Intrcn: 

Exm: 

Step: 



3000 

3000-3118 

3119-7452 

7453-7543 

7544-8039 

8040-8155 

8156-10894 

10895-10968 

10969-11437 

11438-11530 

11531-16047 

16048-16129 

16130-16215 

16216-16298 

16299-16408 

16409-16467 

16468-17301 

17302-17577 

17578-17709 

17710-17789 

17790-19073 

19074-19174 

19175-20904 

20905-21029 

21030-26649 

26650-26794 

26795-27670 

27671-27768 

27769-29273 

29274-29372 

29373 



CHRCMDSCME MAP POSmCN: 

Chranosome 12 



ALLELIC VARIAmS (SNPs) : 

nsiA 

Posit icn Nfajor ^4inor 



Dcrrain 



Protein 

Position Major 



Minor 



1386 
2594 
2757 
6107 
6392 



T 
T 
G 
C 
T 



C 
C 
T 
T 
C 



Beyond 0RF(5') 
Beycixa CRF(5' ) 
Beyond 0RF(5' ) 
Intrcn 
Intrcn 



FIGURE 3K 



J«N 1 2 2003 ^'1 



Docket No.: CL001010 
Serial No.: 09/776,705 
Inventor: Karl GUEGLER et al. 
Title; ISOLATED HUMAN TRANSPORTER PROTEINS. 



^^94 84 


C 


G 


Intrm 


10280 


A 


G 


Intran 


10297 


G 


A 


Intran 


10331 


G 


A 


Intrcn 


10536 


T 


C 


Intran 


11548 


T 




Intran 


11917 


G 


T 


Intran 


12840 


T 




Intran 


12844 


A 




Intran 


12847 


T 




Intran 


13019 


C 




Intran 


13022 


A 


G 


Intran 


13285 


G 


A 


Intran 


144bl 


G 


c 


Intran 


15464 




G 


Intran 


15469 




A 


Intran 


15545 


T 


c 


Intran 


16199 


T 


C 


Intran 


16798 


T 


c 


Intran 


18103 


C 


T 


Intran 


18421 


A 


G 


Intran 


18528 


!3 


A 


Intran 


18722 


T 


c 


Intran 


18775 


C 


G 


Intran 


18951 


T 


c 


Intran 


18974 


T 


G 


Intran 


19540 


A 


c 


Intran 


19841 


G 


A 


Intran 


20170 


A 


C 


Intran 


20343 


T 


C 


Intran 


20519 


G 


A 


Intran 


20963 


T 


C 


Eton 


21840 


G 


T 


Intran 


22783 


C 


T 


Intran 


22787 


G 


A 


Intran 


22825 


T 


C 


Intran 


22967 


A 


T 


Intran 


23248 


A 


G 


Intran 


23764 


G 


T 


Intran 


23765 


C 


T 


Intran 


24432 


A 


G 


Intran 


24538 


C 


G 


Intran 


24693 


T 


C 


Intran 


24819 


C 


T 


Intran 


25743 


C 


T 


Intran 


26044 


G 


C 


Intra)n 


26555 


G 


A 


Intran 


27886 


A 


C 


Intran 


31884 


T 


C 


Beycnd 


32229 


T 


A 


Beycnd 



411 



Ocntext: 



im 

Positicn 
1386 



ACXXATATGGATGICnACTTCmTTUI^^ 

TmTGIAIATAGATITAaGCTt^^ 

TIAAAACAC[GK?IGCmTITrrn7^ 

AATGACmTX^^GIGmTATITGAACra 
[T,C] 

CrmTTIGTATITITCTriTn^^ 
TCIGICmO^TTITATAAGIClTC^^ 
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ATmrncmriTiTAA^ 

TTITATATIXXTITATAATGITITmT^ 
TUmOW^TUICmUITIT^ 
(SEQ ID NO: 24) 

CIt3AACTITC[TITCITAC^^^ 
TACTia::iAAACITCATCXrrcAOCT^ 

ACTEGCTAGATIUIGOCTIGCI^^ 
CTTTTAAAACATC^rATCXXACTCACAAATAOT 
[T,C] 

TTAAGATTIUITITITITimVTA^^ 

GCACIUmTAAAmTCTCCrAATITAGC^ 

TTCnCCirTCTTCrAlTOOCT^^ 

GGAGmUITTXTIOCTQQCTITO 

ACITATTICATTIT^TAAAATmTAATGAT^ 

(SBQ ID NO: 25) 



% *A 



2757 TTAriGCTAGTACiAGACACTO:?^ 
TACTITIGIUITCXJmcr^ 
GAAGACKXTOXATGirmAGATIT^^ 
CATITIOVTIATAGATAQGGACTCm 
TCrmxmUPriTGCATITCTIO^ 
[G,T] 

CrmTGITMG'lll'iGAGGAJniLU'ilOCTin^ 



CTAGATCATICATATICACITATITCA^ 

AAATO^TarATGGAACTGAGAAATG^^ 
(SBD ID NO: 26) 



6107 GITra?IGIl3Cr[UITrCTATCm 

GmCAGATCTCUmCAATAATmTITrCIA^^ 
TTCAGACAGACAGACrm^ 
CTCACTXAAO:TCim:TC0G^^ 
[C,T] 

TOGGATmGAa3aj0CTaxAa3o^cro^^ 

AGTITCAOCmGITGGCO^aaC^^ 

CAGCcraxAAA(:?iTxria3GA3^^ 

ATC^TAGAACAATCTICAATIAT^^ 
GIOTAAAAlTTTCAAAaXJ[C^^ 
(SEC ID ND: 27) 



6392 CAGOCrinriGAGIX3Cia3GA™^ 
TTICIACICAGftGACGAAGITr^^ 
ACJITATCmaXACCrcAGCC^^ 
TG3CCTCTAaG^mATATTAAI70^CA^ 
TITICATGIAGGAAATGin:r[3\A^^ 
[T,C] 

AmC3^TAGIXI3AGCATITmTATAAAAACAAC^^ 

AAAAlTXAATa3CAGGAGGAaGCCri^^ 

AGGlXriJ[AaOC[CAAAGGOCX:ro3^^ 

AACEAAGnGmaVTCTICITO:^^ 

ATA^riGACATCriAAAGITITC^^ 

(SBQ ID NO: 28) 



94 84 GCAACATTmTATCACAAATATCTXTCIT^ 
TCACACAGCXATirGfiGOGCTAAGC^^ 
TIX3OTX^CIUriTIAIGIATAAm^ 

citi^tataagogat:?ito 

[C,G] 
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CTOXITIO:TmTI7\ATmTAC^^ 

ATCAATAGAAATAGGAAAAACITOGCXrm^ 

AAGAATUITOCACITm?W\ATCAaa 

GAATITOCJICAGAAO^ATITijy^^ 

AGCOTGACIQC^TTAAGCTITCTCTC^ 

(SEQ ID NO: 29) 



10280 AT?\AGA(:i[rX?^CATAGC[ACAGa3COTAT^^ 
Tir^TAGTATIGAITICATiaaG^ 
ATITITIAAAAATAAACAAAGATTy^AGAT^ 

Tia:r[OCTa:7^TiTATmaTi^^ 

CTIO^TTia^AAAAATAOCAATOaTIATA^ 
[A,G] 

TCCTAAGACC?OiAATGACICIGAAT^^ 

GIQOCCCATCOTAGCITICICra^^ 

TCTrACAAAGTAOTrajrAATCT^ 

CITO^ATTCTAAGiraGATO^^ 
(SEQ ID ^D: 30) 

10297 (^ACAGQGGITATAAAATTmTAATIOVTaj^^ 
ATIGGG?^TmCX3\AG33ATIAGAICAATIG^ 
AAAGATAAAGATmrmUIGAATI^^ 
AOICAGAiariTCTATCAAGATTOT^ 
ACXT^ATCOTEATAAACmAAAGAAACCATACATaC^^ 
[G,A] 

ACIXriX^TGCAAAAAAAAAAAAAAAAAAAAAAaaGAATIT^ 

TiuiurarmcicmTiATAmi^^ 

TAATCTT^ATOCACAa^VIO^G^^ 
TCX::m^ATITC50OTAAGC^^ 
GAlGIGAATATCAraiAAATATCAIGm 
(SB2 ID NO: 31) 

10331 AAAIGITOmrciAGmTTGAT^^ 

OGAAAGIGIAIil'i'i'i'lAAAAAIAAACAAAGATAAAGAll^ 

AAOGCAGCATItXlXXriOCATT^^ 

aCia3W^TCTiaGATIT3GAAAAATACCAA 

[G,A] 

OGAATITinjrtXXXrATCCTI^^ 

GCmiTCri^mriTACAAAGTAC^^ 

CAi3^CTGCATGCITia30CTIT^ 

T3AATAG0OriTCAATICIAA(:j[r^^ 

Cn^^TGIOriGAGIAAGD^GAIGI^^ 

{SEQ ID NO: 32) 

10536 TACXmTCCrjEATAAACATAAAGAAACO^m 

ATOCITAG 

AITATATATXXAACT3Cr:TGCE^^ 
OCJEAATCIT^Ta^GAaGAT^^ 
ATiariAGATITGAGATIAAacrTI^^ 
[T,C] 

TCAIGIGAATATCATOG?WVmTGATXnA^^ 
GCATTTOCTAATGnarmTACATI^^ 
GIAACATrAAAAAAAAGITAGaGACITC3^^^ 
a^ACTTTAAAGGaiTViUiGiAGAGITCTCAAAAC^ 
ATACITITOGAA^OirilMGAT^ 
(SEQ ID NO: 33) 



11548 Aa:3^TK^TixrnurrjX3^^ 

TCTITia3GCTIAIAATITC^ 

ATmTATGACAAAAATAlll'lAfiTICATIAAAAC^ 

TIGATIUIMaarcTnGATTmTt^^ 
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AAAATiri^AGCTITIGITIXXATmGAATt^^ 
[T,C] 

TICITAAATAOCJITItjrGIOCr^^ 
CATATAAAAOXTCCO^ATIGAATTCTA^^ 
TITTAATGITArmTAGCATCAATmTr^ 
(SB2 ID NO: 34) 



\ 



11917 Tim^lJVGAGAAAAATTXTACTrATACTr^^ 
TAOCITTIUICKXirGCTITA^ 
ACXrrrcOCMriGAATICrAGAAT^^ 
TTATTATAGCATGAATTmTACTC^^ 
GAGAAAACAGITGAa^ITITCTATGI^^ 
[G,T] 

T:?IC[a3GaGAAGATAGCCAAAT3CTC^^ 

OIACTAGACAGITjrUIUIGGAACmT^^ 

CrATAAACXOT3aiTIGntMTC^ 

TO3XA0C3^GCITGAlACnimT^^ 
(SBQ ID NO: 35) 



12840 GACrATia::AGIAGTUITCIAAC^^ 
AACIUIAATTItJETCTOCAGA^ 
ToCATAAAATICITCAGCATGari^^ 
TX^CACAAGATCTiroiCIAGriUl 
CATATCmTClATLlimTCTATOT 
[T,-] 

ATGATCmTGAATITATO::ATCAT^ 

llAmTITATirax™^CAGmCIATTI^^ 

TTCACAGGCmCTITCTCCmL^^ 

CAGACACACACACACACACACACACACAGAC^ 

CT3:i[UmTT3CTOCTCTAGAC^^ 

(SBQ ID NO: 36) 

12844 KTJX^CI^Gn^GIXJi'lVl^APC^ 

TAAAATTCTICAGCATOCXirm^ 
ACAAGATCTia?IXrrAGITCITCIT^^ 

TcmTcr[ATCTimrcmTC^ 

[A,-] 

TUmTCAATTIATCO^TCATCmmCX^ 

ATTiAiroxxriTiAa^Gmcm^ 
CAoacrAcrricinxxriTG^ 

GACACACACACACACACACAGACACA^ 

TcmTiarixxTxnMACLa^^ 

(SEC ID NO: 37) 

AlTIGITCTOIAGATAAACmU^ 
AATIUITCAaCAlOXTITATrAm 
AGATCTIOJIXrrAGITCTIUI^^ 
ATCTATCTITATCTATCTATn^^ 
[T,-] 

ATCAATTIATCXmCATCmmarTA^ 

TATICCOCTAAGAGIACmiTITA^^ 

GCBiLCITTXJ[CCXX:TT3AC^ 

AGACACAO^GACACACACAaOVCACACACA 

ATIGCCXXnXTITO^CrEaJC^ 

(SB2 ID NO: 38) 

13019 CIGACACAAGATCTITCIUIAGITCr^^ 
OCATATCmiCIAlXJillATCTAra 
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CTATCATUmTCJiATTmTCO^TCJ^T^ 
AATTATMrmTTCXXTTAAa^iTIACTA 
TTITCACM3CI7^CITICI^^ 
[C,-] 

GCAGACACACACACACACACACACACA^ 

CTTCGIUIATTGCTCIILCrAG^ 

TTIGTATUIAGIATTnACIUm 

AGCTirraGACTUIGATGGAOCrr^^ 

OGAAGATC^^CromCTCXnTIGIUIGATITI^^ 

(SBQ ID NO: 39) 



13 022 ACACAAGATCITOJIXriAb'ri^ 

TATCTATCTATCTITATCTATCl^^ 
TCATCTATCAATITATOCAraTCrA 

TATPtTrmTimxrcAACAiTmcnAT^ 

TCACAGGCTACTITCniXXIin?^^ 
[A,G] 

CACACACACACAC?^CA:>CAai.CACAGACAGACAC^ 
OCTUTATITGCTarrCITO^CiaJ^^ 

ai7ViurA::?mTGiTAcn^^ 

TTTTa^AOICIGATO^MriGA^^ 

(SEQ ID NO: 40) 

13285 ACTI?IUIUIC?^AACTIXXnXX:^ 

CACACACACACACATnTCTCTCTa^C^^ 

CTAAATAcmjTiarKrn^^ 

CIAAAGGAmTTTTAAAACACTIGAGITMS^^ 
ATITCAGIUIGmCIGIO^CmTC^ 
[G,A] 

ATTTITICATGIAIAAAAATmarrTACA^ 

TAIQGCACAIAATAAGIGITCTGmT^^ 

ra3VIUICrCT3C5^GriGXT^^ 

AAAmXAKMGATIGIGAAAAACTAAAAaVC^^ 

GTTTATGrCAGrCATOrrGAGAAAIT^AC^^ 

(SBQ ID NO: 41) 

14461 mTCGAGCATTTCATAGGATiaXnAI^^ 
GCAIAAGCATAATIAACnriaGACT^^ 
AAAGCATAGOCACIACraCGCTIC^ 
CAAGCOXAGAO^GGIAAGAaXGIGACTr^^ 
ACm:AGAAAIAAraCTCrCACT^^ 
[G,C] 

AGAAATCACilUJATITCTCmiTIA^ 

TOXXriAAAAAATIAACATITACIG^ 

CATIUTCAATrmGAAGACCAAACATGr^ 

TIAATAQCTCGIAAGATIGAATaTIAOCr^^ 

GTICACACIXriGrAAGATITAGCn^^ 

(SB2 ID NO: 42) 

15464 igagitciattitiaa:i[Gaatc^^ 

COWVTOaCIimXTIGAAAAAaGCGIGAI^^^ 
C?]AAlXjr[ATIJCATAIGAATA^ 
CAASAAATCTITCACAAGAiJEAGAT^^ 
AATITIGCXACTTTATATAATira 

[-,G] 

AAAAAAAAAACriUmGAGAAAGAT3GAGAGAA3C^^ 

AATCnA33AAGOCriT3GrriT3ACAAI7^ 

AQGira^^GAGGACTAGATIGGIGCX^ 

T3CA3ATCmXCTIT3ATICIG^ 

TACroaXAAjmTAG?^(:ACTAAGAC 

(SH2 ID NO: 43) 
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TCT?^riTITAACIGAATCTTIT^ 

G™TtnATAT3AATAG?VTATKATTI^^ 

irXXACITTATATAATICCTI^^ 
[-,A] 

AAAAACITCTITCACAAAGATGGAGAGAAG^^ 
AQGAAGOCTITCGITnGACAAlA^^ 

ATCTSCOCrnGATIUIGCOVT^^ 
OXAAGITATAGACACIAACACAaGCACACIGA^ 
{SEQ ID NO: 44) 



15545 PGJOJIX^TGCAlTW^TATrm^ 

CAmTTCATITITIO^GGGAGAAaGCTIGIA!^ 
AGATAATCATIXXTCTATCAC^^ 
TCXIETAGTIAGCCAAAAGGAGAaEA^ 
AAAGATa:MAGAAGCIGTa^TCT^ 
[T,C] 

T3?VCAATAA3TOCjrcr[GAGACT^^ 

GiaxmiUIGCT^GAAAACX:^^ 

Tm^VTCICICAGCIOjOCCAToCL'r 

TAACAGAGGG^GACIGAGrATGOQCTAT:^^ 

GAACTGGAGCTIC?^C1GIAAACTIT3C^^ 

{SBQ ID NO: 45) 

16199 AGAACriGGAAGam3CX:AAATAC^ 
TTCATCITAGAIMraCAAATGIO^^ 
OCmTAAXnCTXJVCATITia^ 
TGAACTAOriGAAGTAAICAGAGCATr^ 
ATQCTCXTTEUIGIGACmCA^OmC^^ 
[T,C] 

TGICIUICACTICIAGfiGAATaC?^^ 
TTOGAATmTTCTKXJO'ITaGC^^ 
AAATATirnATITITXITI^ 

TAcrTO:xiX3CT:?iTiTiT:7^^ 

(SEQ ID NO: 46} 

16798 GIT3GriAGGi\TGAGITITITIGI^^ 
ACTAATATCAAAT?\GITCTKJ[AT^ 
AAAAGAGCAGCXAOjAITIAAAGAITO^ 
Tia:?ICI503CAGACmJI^^ 
CAAIAATCia3CATmGAACAGCA^^ 
[T,C] 

OCAIATICACMGAATCAGaCATICIUI^^ 

AACASjraOGAITAAAGAAGAAATXXTOCr^^ 

lGGACIATCATCAAraXAAAATITAT?\AA[^^ 

AAAAAACILTITGAAGACa^VIGIOVC^^ 

CIOVT3G?\AaaGCAr3AAAGAGaGra^^ 

(SH2 ID NO: 47} 

18103 CATITIAGCAITCTAATIToC^^ 

GAAAGACAGITXTmoriTCCrCT^ 
OVTOriTrAATCT^GAAAGCJ^^ 

TcuriTcioocACx:3vmATa^^ 

ATT3:?ITOTATAAITIT3GCAa 
[C,T] 

QOCTIXI^ATAAGCO^CTIGAI^^ 
CATmCAAGAGATGTGICAATATCrrr^^ 
QGCIAGAAAATCIITCITIUrira^^^ 
TIGAACmariGATATTiaitTOa^ 
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{SEQ ID NO: 48) 



ATCICATITATAGTCAGC^^CXr^^ 
GAATATCTCIU:illUIlGIUITAlllG'lAC2^ 
TCTIXXTOTICATOGICTAra?^^ 
TOITIGQGftACAAAGGATATGAACTC^^ 
[A,G] 

mTmT?^TAATAATrc:TO^CTrCTI^^ 

GACTAAATAOCAAAACTITCaXAT^ 

CnTIOy\£7rATITIGAlGI^^ 

AAGCrK^i^^TTCATCrrraXAGC^^ 

CAOCTAGraO^ACTCXrraGCAl^^ 

(SEQ ID NO: 49) 



18528 ACAAGAGATGIGIOiATATCrrGrCUITI^^ 
AGAAAATCTIGITIUITCOOiTCfVTt^^ 
A£OTUnGATAriTCCIT3GGAACAAM^ 
COTmAG^GAOGATATTATATAATAATICI^^ 
TCA:T[CIT2ATrC[CACI^^ 
[G,A] 

AAATirGAGAACACrrrTGAAGTATTri^^ 

GCAGTIACIXXAGAAaCCTSAG?^^ 

Tim^AA^rmrCAGCTASTIU^ 

TIUmrcCTCATTIAAAC^IO^T^^ 

CTOirinVTCITIATGITAATTAlA^^ 

(SEQ ID NO: 50) 

18722 TATIATAIAAIAATICICAACTIXrr^^ 
CACIAAATAGCAAAACirrCCCCATA^ 
CTITCAACIIAriTIGAIGIUI^^ 

CMGIAGTClGGACTCrraaC^^ 
[T,C] 

TAAACTCATCTCATIAIGAAAr^^ 

TGITAATmiATIXrrmTIO^GI^^ 

TUriUI?^GaGAAA3GATIGnTnA?^ 

GAATCXnX3^TIAGCrCTCIAAGACA^ 

GTCTIOm^AITAGCAAmTAGAmT^ 

(SB2 ID ^D: 51) 

18775 TCmrOX^^CIAAATAGCAAAAC^^ 
GAGg^OOTICAAGmTrTIGATG^ 
ACIOCAGAAGOCrEGACMrn^ATCTI^^ 
AGCITITC?OGI?^GIUIGGAC^^ 
TCUrcxriTAAACTCAICI^^ 
[C,G] 

ATCTTTAIGITAATIATATrc™ 

TOGAAAlGAAlXXriOVnAGCICL^^ 
TAAATATGICITCATGAITAGCAAT^^ 

{SEQ ID NO: 52) 



18951 CAGAAGCITTICAGCTAGra^^ 
Cl'illOClCAITITWisGIOVICr^ 

AT?O^CTGITCrTCTIUI^^ 
AAAAAT3GAAATGAAlXXr[OmAGCIUK^^ 
[T,C] 

CTACTAAATATGICITCATGATI^^ 
AGia3GIO:XI33^GAAAAATO^^ 
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{SEQ ID NO: 53) 



ACT0CTO3CAGrA:TrACITKXr^ 
CATrAT:W\ATCKAAAAGCTITCT^TCJ^ 
TTCITATTCAGIXI3GCAAGCTmCl^ 
AATG?VTIGmTIAAGACr[rM33VC™i[G^ 
AGCICICTAAGAC?\AATITAAATCAG^ 
[T,G] 

AGCAATAIAG?VTATACTITITmTmTT?^TI^^ 

AGITrAAAAAACAAAGCIT3GIGriUI^^ 

AAa3GIGrCAAAT^TTia:ATCAa3GGG?^T^^ 

TOC^XTACUTAAOCTrCTATa^IAa^^ 

GTIAGGCTCGinriTCAari^^ 

(SBQ ID ND: 54) 



19540 aGmT3ATITITAAAAATIGCOT::TAT^^ 
AAmTACTATITIACmTIGIOVCITGAC^^ 
AAC?^AGITAGAAGAATnT:riX^ 
ATTAITITOCICIAaGGArAGAIl^^ 
CTACAACirGATCAGAATCMT^^ 
[A,C] 

OTjCCAC3lCAAATAGC?^GGGAAACI^^ 

a^CTCAAGICAAAIUIGOGAGCTraaG^^ 

mTmtJTTATACIAAATGATTira 

AAG?^TTXACC?VaCAAAAAAAA!:?IGACmia3^^ 

CTCIXriGGCATCITAACAGr^ 

(SEQ ID NO: 55) 



19841 CIQCO^CACAAAIAGCT^GaGAAACTGCCftQ:?^^ 
CCAGIOWJiX:AAATXria3GAGC^^ 
TA.TITIUITAmCTAAAIGATTia^ 
AAGAAia3^a3Va3^AAAAAAAGTCACIAT3aa3^CA^ 
CTCTCTOXAIUITAAGAITCT^ 
[G,A] 

COOVIAAACAGAITICA.TOGAATCACA^^^ 
PGIUIATCAKmmAP^ 

TiTKnTACJX^cnxx^mN^^ 

AGCS^TIAGGaCAaCATriGCTIAaU'lll'^ 
CAAACAIGIGCTITAGAITOaaCAI"^ 
(SEQ ID NO: 56) 



20170 mriGACXIDaGIAGCAmiGmA^^ 
TAAACCTCTITITAAAGITITI?^^ 
GCAIGIATGACAAATIGXa^^GGAAAAG?^ 
TTITICIATTIGACTAAIATTIG^^ 
CITITIGCXJI?nGIt3^TAATG^^ 
[A,C] 

CTirriGICK^TCTIGAGGIGAAA^^ 

TTAriTGIGCTIGGAGIXXT^ 

GAGIAGma:mcm9GAGAaC3VTCr^^ 

ATACTIATITIAAIGAGITIOmy^ 

GITITAGIGia:nGAiaGgOXA^ 

(SBQ ID NO: 57) 



20343 iwjnTnTrrcn^Tnx^^cr^^ 

GGATTAACiUlllGCm^nUIGAAATAATGAAI^ 

TITCACT?^C[TITITjrxriGAT^^ 

TTXTCTIGrmTTIGIGCTIGGAGI^^ 

[T,C] 

GTmcnATAcrmrrrvAATO^G^^ 
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TCATTCATCOT^GAGTITAa^ 
TAOOGAam^GAGaiaCJIMGCI^^ 
CC[TCAACAAGAGCAaCTia?ICI^^ 
[SEQ ID NO: 58} 



20519 GATATITIUITOTATm^IXXTraGAC^^ 

TGI^VOIITACITATACITATI^^ 
OCAAAAAmTCiaiT™:^!^^^ 
AGGAATCATTO^TCCTAGAiJITrACAC^ 
[G,A] 

GAa^AoaGACAaaaGAGCTa^rAGG^az^A^^ 

ATCXrCITCAACAAGAGCAariTGC?!^^ 
AGAm^TITATCACAATGIAAATGAGCIT^ 
TIXAAa3ITATC70:TIO=^3G^ 
AGCIX5\(30XAAGAaGAGAaC?IO^ 
(SH2 H) NO: 59) 

20963 TX^AGCTIGAGAOGCICirrATIT^^ 
CIXrrrinTTCATTIGA^^ 
GOCraGATTOCO^GGCrrAGGIU 
CATAAGIGAOriGiariGATnGACA^ 
AaC?^GAAGIT3AAGATt3^TmCTIX:^^ 
[T,C] 

CTICKXDoGITOoCrTOGCA^^ 

TTOO^AGIAAGIACATAAGAC^^ 

AT:?IGITCTA0CTIOOTTIt5^ 

GATATTATTTTCCrATAGGATIT^ 

AQCAAGIAGITIOOTraGT^ 

{SEQ ID NO: 60) 

21840 AAACAOITATtXri^VIUmTC^^ 

AOGio^ocxxrioariaxATnGiCATia^ 

GCiaJD^GIGAAGATTTOOQCIGI^^ 

GAAAAAAATAA£J[t^aO^GI?^CX30^ 
[G,T] 

aCAIATGAAGACTATITCTATRATGAGACTT[^ 

AAAIATATITraGCACMTCXri^^ 

GTOSCIAATATITITOXJITA^^ 

ATGACATACACATAGCITIAGCCIAAA 

ACmTAAACAGAAGC?IAIA03ATAGaCT7\^^ 

(SEQ ID lO: 61) 

22783 TGAGAAAIAAAGCACIGATAIAAATCIGA^^ 
AGATCCTAlTAGAACaW^TIGfta^^ 
TGIGCTIOVITATGITOTTCCA^^ 

AGAGIOm\AGIOQCia3GACTT^^ 
[C,T] 

GiraiGCIT3a3AAGAAAAAT3GICIT^ 

ACiaxrmCIAGCAAAAGCATA^ 

AGCIGOGirGAOGATCTOaGCIAAAIGAACO^^ 

AGAT3Cr[GACAGAGAGia?ICACOTaD^^ 

AATK^^TITITITIUITIAC^^ 

{SEQ ID NO: 62) 

22787 AAATAAAGGACIGATAIAAATCTGACOV^^ 

GCX^^TIAGAACOW^ATIGAOCATAAGAACOO^ 
CTIOVTmTGmTITa3VCTG?V^ 
CXr[CACajrT3CAATC?ITACAT^ 
TCJmy^GTOCUiraGACITGGICAT^^ 
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[G.A] 

Ta:TIX3aGA;O^AAAAIQOTriTC^^ 

OOCJITCAGGATCiraaCIT^^ 
aCJIGAO^GAGAGTGGICACTroOJIGAC^^ 
GAATITITITIUITITO'AAAATGATAAAA^^ 
{SEQ ID NO: 63) 



22825 cmtagigictaaacatiagatoxatt;^^ 

AGAAAAATGACTAACKXTtJTCCTIC^^ 

OGGACAS^^aairGAGCIUIGITO^^ 
[T,C] 

TTCJIUnGAAATOmXAAACIGCC^^ 

tacatcaaggatagacagagatogk:^^ 
tcaaijiotcigaagciaaat^^ 

TOrXXTITKXTIGITrATIT^^ 
{SB2 ID NO: 64) 

22967 COOVCOITIO^yVTGITAC^ 

OTXTia3GAAGAAAAATaC?IUITCA?^ 
GCUITITC:iAGCAAAAGCATAGACAC^^ 
TOOaiTCAGGATCTOaaCTAAATCAAa^^ 
[A,T] 

aGIGACAGAC?OI3GIOOTIXXI^^ 

CAAITITITITCITrA!:^^ 

a^TAAOTAGOGCIOO^LTTITCA^^ 

TGACACJITIATITCATTrAAaGAACTCT^ 

CIXXACACATIAAIAA3AAAACIAACAAAAC^^ 

{SEQ ID NO: 65) 

23248 CATOM^a^TACMJ^^ 

ATIO^llTill'ilL'i'IlACIAAAATCATAAAAiJilb'llATTS 

GACIO^CAGCATOGITCTOCIG^ 
TTATTIACITXniTCAmTC^^ 
[A,G] 

TIAAAATCnT^TO^GCITIAQGCAT^ 
GCIUTUmUITAGGftGCnG^ 
ATrnTCTTTICITITITCO^l^^ 
ACIGCITCITOXITIGITCAACAA^ 

{SEQ ID NO: 66) 

cnAT3CTToariTmT^ 

AATCX:TAATaGAmTATAATGAACACACAC^TC^^ 
AAASATOCCnAGCTACiU^UAITICAlCIT^^ 
[G,T] 

TCTIO^TaXTTCITITGAAG^^ 

CCTIGCAAAATCAAAC^IGIGAGCTA^ 
AAGA(:?]T3^GAACI5^AGACIC^ 
GGGCXXTTGCATCAGAATAGAGCACarCIA^ 
(S03 ID NO: 67) 



23765 AAAIGAATIAAOJCAGIGrriTCACnT^^^ 
Gra3aGmTCAaiTATaGAATOGCAa^^ 
TIATCCrirariTmT02ACT[^^ 
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AAGATOOCnxrT?VLl'lGlATITCATCTm 
[C,T] 

CrATCAAOaZAGia^^GCAGIGA^^ 

CTOIAAAATCAAAGIGIGAGCIAa^K^^ 

AGATTTAGAACTT^AGACirrcACI^^ 

aaaxxr[a:ATCAGAAiACAGCA(XxxrT^ 

(SBQ ID ND: 68) 



.?4432 GGATO^IGCTa^GGAacrrOCrTCy^CXXAC^^ 
GACTIUIGICA!:7IAAA3AAAAA^ 
OGAATCAGOCACTa^ICiarrTAGGAAA^ 

TAOGcio^cmcrcAaQGAaGa^cmAAG^ 

CTITTCCACCACCAaACAAi^^ 
[A,G] 

QGAATOIACTOGAAAOCACXnTCAaiT^^ 

TCATCrc?^TATCATTM30C^ 
TIX:AGAaGCCX:TOGACATATAGGA!3acri^^ 
GAATGCAOriTITrAACAAaiGACTA^ 
(SH2 ID ND: 69) 

24538 Q^T^AOGCITGCJiaX^TCAGCCAGira 

aao3criTia?ia:TAaGcr[o^ 

GmcrirariGAiuriTiravaiAC^ 

OTOTracix7r[a3aGAATtx^ 

TTaJCTATIOOWO^GGAAGAAGCil'IUGC^ 
[C,G] 

OAIAAACTITClGIGATCr^^ 

AGACTOZTTIOVCATK^O^C^^ 

CAGCTIArKS!LCriGAAIQCAajri'^ 

CTCIGAAAAIGAGOCrmTATCIO^ 

AAGraCTCrCOXACIAIGIAmTACOm^ 

{SEQ ID NO: 70) 

24693 ocmnAOCTOGCinxnrTiT^^ 

TTCAGITCriGma^AAITri^^ 

mcrocAAOiJPiciAacTATmTK^^ 

AAAIGnAATCTXTCiaOGAGOCAGGfiGAC^^ 

AOGACiaxrrciAAax^^crci^^ 

[T,C] 

GACIAAAAAACAAACnilimCTATIUIUI^^ 

TCIGTriAACACTSTCAAACAAATIAAG^^ 

TIAITIGIAAGCCrACIAATK3GA0C^^ 

ATCATAAIXJ[AGAATIATAaaCT3C7ra^ 

GITIOJIITIGAGAATISVjITIOVI^^ 

(SEQ ID NO: 71) 



24819 AACnJCTCTEAOCT^TmTira^ 

TAATCITICia3GAGaCAaGAGACIG^^ 
GCCICIAACIt^^CICCAACTCAGCT^ 
AAAAACAAACToIGACTA-TICICIGAAAATT^^ 
TTAACACIGIt^^yyOWVTTAAlG^^ 
[C,T] 

GIAAGCr:mCI7VOT3GACCAGITri^ 

ATGTTO^TmTAQGCTOCTCAaMAACAATAT^ 

TTITCAGAATGAGmCA.TAATGIT^ 

TTUmAACmTITOCTO^GACrLAl^^ 

AGOnAAACAGATarrnTOTITITGG^^ 

(SBQ ID NO: 72) 



25743 TATCx:A:j[m:MCAaa3n7\^^ 
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CGOCTACITATTKmjroCI^^ 
GATAAGATIOCnTCTATTCAATTTA^^ 

TAi:^TTmCAAlCTATIGATA3aGAGAGCIAP^^ 
[C,T] 

AAAAAO^ICAAAATITIAAAATACO^TITIAAAATT^ 

GCAAAATTAAGGAAAACCIAGATICATAAAAATIOT 

TC7V:?IT3CIT3GCUIT^^ 

GITITAinATtTITICITAA^^ 

TTIAACAATAAATTCT3AAAAaAAGAAAGIGAAAG^ 

(SB2 ID ND: 73) 



26044 AAAAAO^KAAAATTnAAAAmaCATITITW^ 

GCAAAATTAAGGAAAAOCTAGATICATAAAAATT^^ 
TCAGIOrnQCCX:mATGIUI^^ 

GTTiTAirrATamcnAATmT^ 

[G,C] 

GCX3W33AAAAAAIGAGACrcr:ATITG^ 

ATX¥^T7rAGAGICAIGIGIT3C^ 

aGCAATITIGICmGia:AAACAOCA^^ 

Acmmc?^ciAAaGcmmTOGCATAGCcm?r^^ 

{SEQ ID ND: 74) 

26555 A!:T]ACP3:TAaGClACAAACCIGmCAGC^^ 
TAACATAATOC^ITiACTIAITIUIG^ 
TTACAA'XTIAaQ:jIAlCACIGI^^ 

tiaataozactcmctciagactia;^^ 
gttitataataataaaaaaattcaaaaaagcato^c^ 

[G,A] 

GAAAGAAAA10:ATI700T^T3M3AATAT^^ 

GAAATGATCTTGOCrmW^Ll'l'i^^ 

ATITOXAAACX^ACnriTCAGCD^ 

ACTIAAIAATGITCTaGTO^TCCTT^^ 

TG?iGITICAGAAM3CITCAAITI^ 

(SBQ ID ND: 75) 

27886 a^irmriTAAAlGIGIGIOT^^ 

OAGCACJnU'il'mraTAAACITCia^ 
aQaCJ[AA(T[AAACX:TItXAATIT0a3^^ 
AACIAGAAAAIAG?^CATAaiTCA!3Wyy^T^ 
[A,C] 

TOXXnAGAACTITCTOJCAaCAGA^ 

ACTIOWVmiXAAAIATCClUIGITCACT^ 

TCn::ACAGAGAGATATtXTICAaaC^ 

TATai?ia3AAAAIOVTO:AAGAAGGAAT^^ 

AAAmTITIT3:j[CAT3GIT^ 

(SH3 ID NO: 76) 

31884 Cill'mT3GITAGITIT3y^AGAATCX:AI^ 

TGAGAATAGmAAATAAAGAaGAGAGAAAAmra3aa3^^ 
AOGATTCATmTCftATCAAOSVTIAGa^ 
AACirmiAAGAAACITOCAGATAGITTO 
OjmCTACUmCAACITOZACIGACT^ 
[T,C] 

ATITIAGCX:TITC:iAGTaGGIC^ 

GraACAACIAAT:?IT3AAAACrillGAACiriJlU' 

TGAATTGIUmTICAAAIUITITGCC^ 

GmriGITO^AGCICITIAAATAT^^ 

TCTATOGmrarnGCITATITK^^ 

(SEQ ID ND: 77) 
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0 



TITC^VTITCKCTIUIGIGAGAAC^^ 
CAT?^mTCrriCITIT3IGAAGITT^^ 
T^^TOIGITITrATTOaGmTIT:?^^ 

TCTITCT3aaGOC3^acr[CACG?^™^ 
[T,A] 

GAAAAACITArillAAATTAAACA 
(SBQ ID NO: 78) 
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